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Preface 


Physical and natural phenomena depend on a complex array of factors. The sociol- 
ogist or psychologist who studies group behavior, the economist who endeavors 
to understand the vagaries of a nation’s employment cycles, the physicist who 
observes the trajectory of a particle or planet, or indeed anyone who seeks to 
understand geometry in two, three, or more dimensions recognizes the need to 
analyze changing quantities that depend on more than a single variable. Vec- 
tor calculus is the essential mathematical tool for such analysis. Moreover, it 
is an exciting and beautiful subject in its own right, a true adventure in many 
dimensions. 

The only technical prerequisite for this text, which is intended for a 
sophomore-level course in multivariable calculus, is a standard course in the cal- 
culus of functions of one variable. In particular, the necessary matrix arithmetic 
and algebra (not linear algebra) are developed as needed. Although the mathe- 
matical background assumed is not exceptional, the reader will still be challenged 
in places. 

My own objectives in writing the book are simple ones: to develop in students 
a sound conceptual grasp of vector calculus and to help them begin the transition 
from first-year calculus to more advanced technical mathematics. I maintain that 
the first goal can be met, at least in part, through the use of vector and matrix 
notation, so that many results, especially those of differential calculus, can be 
stated with reasonable levels of clarity and generality. Properly described, results 
in the calculus of several variables can look quite similar to those of the calculus 
of one variable. Reasoning by analogy will thus be an important pedagogical tool. 
I also believe that a conceptual understanding of mathematics can be obtained 
through the development of a good geometric intuition. Although I state many 
results in the case of n variables (where n is arbitrary), I recognize that the most 
important and motivational examples usually arise for functions of two and three 
variables, so these concrete and visual situations are emphasized to explicate the 
general theory. Vector calculus is in many ways an ideal subject for students 
to begin exploration of the interrelations among analysis, geometry, and matrix 
algebra. 

Multivariable calculus, for many students, represents the beginning of signif- 
icant mathematical maturation. Consequently, I have written a rather expansive 
text so that they can see that there is a story behind the results, techniques, and 
examples—that the subject coheres and that this coherence is important for prob- 
lem solving. To indicate some of the power of the methods introduced, a number 
of topics, not always discussed very fully in a first multivariable calculus course, 
are treated here in some detail: 


* an early introduction of cylindrical and spherical coordinates ($1.7); 

e the use of vector techniques to derive Kepler’s laws of planetary motion 
(83.1); 

* the elementary differential geometry of curves in R3, including discussion 
of curvature, torsion, and the Frenet—Serret formulas for the moving frame 


(83.2), 


e Taylor’s formula for functions of several variables ($4.1); 
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e the use of the Hessian matrix to determine the nature (as local extrema) of 
critical points of functions of n variables (§4.2 and $4.3); 


e an extended discussion of the change of variables formula in double and triple 
integrals (§5.5); 


° applications of vector analysis to physics (§7.4); 


e an introduction to differential forms and the generalized Stokes’s theorem 
(Chapter 8). 


Included are a number of proofs of important results. The more techni- 
cal proofs are collected as addenda at the ends of the appropriate sections so 
as not to disrupt the main conceptual flow and to allow for greater flexibility 
of use by the instructor and student. Nonetheless, some proofs (or sketches of 
proofs) embody such central ideas that they are included in the main body of the 
text. 


New in the Fourth Edition 


I have retained the overall structure and tone of prior editions. New features in 
this edition include the following: 


e 210 additional exercises, at all levels; 


e a new, optional section (§5.7) on numerical methods for approximating 
multiple integrals; 


e reorganization of the material on Newton’s method for approximating 
solutions to systems of n equations in n unknowns to its own (optional) 
section (§2.7); 


e new proofs in Chapter 2 of limit properties (in §2.2) and of the general 
multivariable chain rule (Theorem 5.3 in §2.5); 


e proofs of both single-variable and multivariable versions of Taylor’s theorem 
in §4.1; 

e various additional refinements and clarifications throughout the text, 
including many new and revised examples and explanations; 


* new Microsoft® PowerPoint® files and Wolfram Mathematica® notebooks 
that coordinate with the text and that instructors may use in their teaching 
(see “Ancillary Materials” below). 


How to Use This Book 


There is more material in this book than can be covered comfortably during a single 
semester. Hence, the instructor will wish to eliminate some topics or subtopics—or 
to abbreviate the rather leisurely presentations of limits and differentiability. Since 
I frequently find myself without the time to treat surface integrals in detail, I have 
separated all material concerning parametrized surfaces, surface integrals, and 
Stokes’s and Gauss’s theorems (Chapter 7), from that concerning line integrals 
and Green’s theorem (Chapter 6). In particular, in a one-semester course for 
students having little or no experience with vectors or matrices, instructors can 
probably expect to cover most of the material in Chapters 1—6, although no doubt 
it will be necessary to omit some of the optional subsections and to downplay 
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many of the proofs of results. A rough outline for such a course, allowing for 
some instructor discretion, could be the following: 


Chapter 1 8—9 lectures 
Chapter 2 9 lectures 
Chapter 3 4-5 lectures 
Chapter 4 5—6 lectures 
Chapter 5 8 lectures 
Chapter 6 4 lectures 


38-41 lectures 


If students have a richer background (so that much of the material in Chapter 1 
can be left largely to them to read on their own), then it should be possible to treat 
a good portion of Chapter 7 as well. For a two-quarter or two-semester course, 
it should be possible to work through the entire book with reasonable care and 
rigor, although coverage of Chapter 8 should depend on students’ exposure to 
introductory linear algebra, as somewhat more sophistication 1s assumed there. 

The exercises vary from relatively routine computations to more challenging 
and provocative problems, generally (but not invariably) increasing in difficulty 
within each section. In a number of instances, groups of problems serve to intro- 
duce supplementary topics or new applications. Each chapter concludes with a 
set of miscellaneous exercises that both review and extend the ideas introduced 
in the chapter. 

A word about the use of technology. The text was written without reference 
to any particular computer software or graphing calculator. Most of the exercises 
can be solved by hand, although there is no reason not to turn over some of the 
more tedious calculations to a computer. Those exercises that require a computer 
for computational or graphical purposes are marked with the symbol Q and 
should be amenable to software such as Mathematica®, Maple®, or MATLAB. 


Ancillary Materials 


In addition to this text a Student Solutions Manual is available. An Instructor’s 
Solutions Manual, containing complete solutions to all of the exercises, is 
available to course instructors from the Pearson Instructor Resource Center 
(www.pearsonhighered.com/irc), as are many Microsoft® PowerPoint® files and 
Wolfram Mathematica® notebooks that can be adapted for classroom use. The 
reader can find errata for the text and accompanying solutions manuals at the 
following address: 

www.oberlin.edu/math/faculty/colley/VCErrata.html 
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Figure 2 The coordinate plane R°. 


To the Student: 
Some Preliminary 


Notation 


Here are the ideas that you need to keep in mind as you read this book and learn 
vector calculus. 


Given two sets A and B, I assume that you are familiar with the notation 
A U B for the union of A and B—those elements that are in either A or B (or 
both): 


AUB={x|x € Aorxe B}. 


Similarly, A N B is used to denote the intersection of A and B—those elements 
that are in both A and B: 


AN B= {x|x € Aandx €B}. 


The notation A C B, or A C B, indicates that A is a subset of B (possibly empty 
or equal to B). 


One-dimensional space (also called the real line or R) is just a straight line. 
We put real number coordinates on this line by placing negative numbers on the 
left and positive numbers on the right. (See Figure 1.) 

Two-dimensional space, denoted R’, is the familiar Cartesian plane. If we 
construct two perpendicular lines (the x- and y-coordinate axes), set the origin 
as the point of intersection of the axes, and establish numerical scales on these 
lines, then we may locate a point in R? by giving an ordered pair of numbers (x, y), 
the coordinates of the point. Note that the coordinate axes divide the plane into 
four quadrants. (See Figure 2.) 

Three-dimensional space, denoted R?, requires three mutually perpendicular 
coordinate axes (called the x-, y- and z-axes) that meet in a single point (called 
the origin) in order to locate an arbitrary point. Analogous to the case of R’, if we 
establish scales on the axes, then we can locate a point in R? by giving an ordered 
triple of numbers (x, y, z). The coordinate axes divide three-dimensional space 
into eight octants. It takes some practice to get your sense of perspective correct 
when sketching points in R°. (See Figure 3.) Sometimes we draw the coordinate 
axes in R? in different orientations in order to get a better view of things. However, 
we always maintain the axes in a right-handed configuration. This means that 
if you curl the fingers of your right hand from the positive x-axis to the positive 
y-axis, then your thumb will point along the positive z-axis. (See Figure 4.) 

Although you need to recall particular techniques and methods from the 
calculus you have already learned, here are some of the more important concepts 
to keep in mind: Given a function f(x), the derivative f’(x) is the limit (if it exists) 
of the difference quotient of the function: 


fa +h)— f@) 
3 : 


fœ) = lim 


XVI To the Student: Some Preliminary Notation 


lah OF 


Figure 3 Three-dimensional Figure 4 The x-, y-, and z-axes in R? are always 
space R°. Selected points are drawn in a right-handed configuration. 
y graphed. 


(xo, F(%o)) 


Figure 5 The derivative f’(xo) is 
the slope of the tangent line to 


y = f(x) at (xo, f(x0)). 


The significance of the derivative f’(xq) is that it measures the slope of the line 
tangent to the graph of f at the point (xo, f(xo)). (See Figure 5.) The derivative 
may also be considered to give the instantaneous rate of change of f at x = xo. 
We also denote the derivative f'(x) by df/dx. 

The definite integral i f(x)dx of f on the closed interval [a, b] is the limit 
(provided it exists) of the so-called Riemann sums of /: 


b n 
J f(x)dx = lim oe FAX. 


all Ax;—> 


Here a = x9 < xı < X2 < +--+ < x, = b denotes a partition of [a, b] into subin- 
tervals [x;_1, x;], the symbol Ax; = x; — x;_, (the length of the subinterval), and 
x; denotes any point in [x;-1, x;]. If f(x) > 0 on [a, b], then each term f(x¥)Ax; 
in the Riemann sum is the area of a rectangle related to the graph of f. The 
Riemann sum )~y_, f(x*)Ax; thus approximates the total area under the graph 
of f between x = a and x = b. (See Figure 6.) 


| 

l 

i 

| 
a X X Xzsej e Xi-1! Xi e Xn-1b 

l 

a 
Figure 6 If f(x) > 0 on [a, b], then the Riemann sum 
approximates the area under y = f(x) by giving the sum 
of areas of rectangles. 
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Figure 7 The area under the graph of y = f(x) is 
SÉ f@dx. 


The definite integral J , f(x)dx, if it exists, is taken to represent the area 
under y = f(x) between x = a and x = b. (See Figure 7.) 

The derivative and the definite integral are connected by an elegant result 
known as the fundamental theorem of calculus. Let f(x) be a continuous func- 
tion of one variable, and let F(x) be such that F’(x) = f(x). (The function F is 
called an antiderivative of f.) Then 


b 

1. f(x) dx = F(b) — F(a); 
d x 

2 | f(t)dt = f(x). 


Finally, the end of an example is denoted by the symbol and the end of a 
proof by the symbol E. 
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- Vectors 


1.1 Vectors in Two and Three Dimensions 


For your study of the calculus of several variables, the notion of a vector is 
fundamental. As is the case for many of the concepts we shall explore, there are 
both algebraic and geometric points of view. You should become comfortable 
with both perspectives in order to solve problems effectively and to build on your 
basic understanding of the subject. 


Vectors in R’ and R°: The Algebraic Notion 


DEFINITION 1.1 A vector in R? is simply an ordered pair of real numbers. 
That is, a vector in R? may be written as 


(a;,a2) (e.g., (1, 2) or (r, 17)). 
Similarly, a vector in R? is simply an ordered triple of real numbers. That is, 
a vector in R? may be written as 


(a, a2, a3) (e.g., (T; e, /2)). 


To emphasize that we want to consider the pair or triple of numbers as a 
single unit, we will use boldface letters; hence a = (a), a2) or a = (a1, a2, a3) 
will be our standard notation for vectors in R? or R°. Whether we mean that a is a 
vector in R? or in R? will be clear from context (or else won’t be important to the 
discussion). When doing handwritten work, it is difficult to “boldface” anything, 
so you’ll want to put an arrow over the letter. Thus, a will mean the same thing 
as a. Whatever notation you decide to use, it’s important that you distinguish the 
vector a (or 4) from the single real number a. To contrast them with vectors, we 
will also refer to single real numbers as scalars. 

In order to do anything interesting with vectors, it’s necessary to develop 
some arithmetic operations for working with them. Before doing this, however, 
we need to know when two vectors are equal. 


DEFINITION 1.2 Two vectors a = (a, a2) and b = (by, b2) in R? are 
equal if their corresponding components are equal, that is, if aj = bı and 
dz = bz. The same definition holds for vectors in RÌ: a = (aj, a2, a3) and 
b = (bı, b2, b3) are equal if their corresponding components are equal, that 
is, if ay = (Dilo d = bo, and d3 = b3. 
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EXAMPLE 1 The vectors a = (1, 2) and b = (3, $) are equal in R?, but c = 
(1, 2,3) and d = (2, 3, 1) are not equal in R°. + 


Next, we discuss the operations of vector addition and scalar multiplication. 
We’ll do this by considering vectors in R? only; exactly the same remarks will 
hold for vectors in R? if we simply ignore the last component. 


DEFINITION 1.3 (VECTOR ADDITION) Let a = (a, a, a3) and b = (b, 
bz, b3) be two vectors in R°. Then the vector sum a + b is the vector in R? 
obtained via componentwise addition: a + b = (a; + b1, a2 + bo, a3 + b3). 


EXAMPLE 2 We have (0, 1, 3) + (7, —2, 10) = (7, —1, 13) and (in R2): 
(1, 1)+ (a, V2) = +x, 1+2. + 


Properties of vector addition. We have 
1. a+b = b + a for all a, b in R? (commutativity); 
2. a+ (b+ c¢)= (a + b) + c for all a, b, ¢ in R? (associativity); 


3. a special vector, denoted 0 (and called the zero vector), with the property 
that a + 0 = a for all ain R°. 


These three properties require proofs, which, like most facts involving the al- 
gebra of vectors, can be obtained by explicitly writing out the vector components. 
For example, for property 1, we have that if 

a= (a, a2, a3) and b= (bı, bz, b3), 
then 


a+b = (a; + bı, a + b2, a3 + b3) 
= (bı Gj; bz a2, b3 a3) 
=b+a, 


since real number addition is commutative. For property 3, the “special vector” 
is just the vector whose components are all zero: 0 = (0, 0, 0). It’s then easy to 
check that property 3 holds by writing out components. Similarly for property 2, 
so we leave the details as exercises. 


DEFINITION 1.4 (SCALAR MULTIPLICATION) Let a= (a1, a2, a3) be a vec- 
tor in R? and let k € R be a scalar (real number). Then the scalar prod- 
uct ka is the vector in R? given by multiplying each component of a by 
kaka = (kaj, Ka, ka3). 


EXAMPLE 3 Ifa = (2,0, V2) and k = 7, then ka = (14, 0, 7/2). + 


The results that follow are not difficult to check—just write out the vector 
components. 


© (a, ay) 


Figure 1.1 A vector a € R? 
corresponds to a point in R°. 


bd (a, 42, a3) 


Xx 


Figure 1.2 A vector a € R? 
corresponds to a point in R°. 
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Properties of scalar multiplication. For all vectors a and b in R? (or R?) 
and scalars k and / in R, we have 

1. (kK+))a=ka+la (distributivity); 

2. k(a+b)=ka+kb (distributivity); 

3. k(la) = (kl)a = I(ka). 


It is worth remarking that none of these definitions or properties really de- 
pends on dimension, that is, on the number of components. Therefore we could 
have introduced the algebraic concept of a vector in R” as an ordered n-tuple 
(a1, a2, ..., an) of real numbers and defined addition and scalar multiplication 
in a way analogous to what we did for R? and R?. Think about what such a 
generalization means. We will discuss some of the technicalities involved in §1.6. 


Vectors in R’ and R’: The Geometric Notion 


Although the algebra of vectors is certainly important and you should become 
adept at working algebraically, the formal definitions and properties tend to present 
arather sterile picture of vectors. A better motivation for the definitions just given 
comes from geometry. We explore this geometry now. First of all, the fact that 
a vector a in R? is a pair of real numbers (a1, a2) should make you think of the 
coordinates of a point in R?. (See Figure 1.1.) Similarly, if a € R3, then a may 
be written as (a1, a2, a3), and this triple of numbers may be thought of as the 
coordinates of a point in R°. (See Figure 1.2.) 

All of this is fine, but the results of performing vector addition or scalar mul- 
tiplication don’t have very interesting or meaningful geometric interpretations in 
terms of points. As we shall see, it is better to visualize a vector in R? or R? as an 
arrow that begins at the origin and ends at the point. (See Figure 1.3.) Such a depic- 
tion is often referred to as the position vector of the point (a), a2) or (a1, a2, a3). 

If you’ve studied vectors in physics, you have heard them described as objects 
having “magnitude and direction.” Figure 1.3 demonstrates this concept, provided 
that we take “magnitude” to mean “length of the arrow” and “direction” to be the 
orientation or sense of the arrow. (Note: There is an exception to this approach, 
namely, the zero vector. The zero vector just sits at the origin, like a point, and has 
no magnitude and, therefore, an indeterminate direction. This exception will not 
pose much difficulty.) However, in physics, one doesn’t demand that all vectors 


In R? In R? 


y z 


(a, 4), a3) 


(a4, a) 


x 


Figure 1.3 A vector a in R? or R° is represented by an arrow from the 
origin to a. 
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Figure 1.6 The vector 

a + b may be represented 
by an arrow whose tail is at 
the tail of a and whose head 
is at the head of b. 


be represented by arrows having their tails bound to the origin. One is free to 
“parallel translate” vectors throughout R? and R°. That is, one may represent 
the vector a = (a1, a2, a3) by an arrow with its tail at the origin (and its head at 
(a, a2, 43)) or with its tail at any other point, so long as the length and sense of 
the arrow are not disturbed. (See Figure 1.4.) For example, if we wish to represent 
a by an arrow with its tail at the point (x1, x2, x3), then the head of the arrow 
would be at the point (x; + a1, x2 + a2, x3 + a3). (See Figure 1.5.) 


Zz 
A a 
Zz 
a, M, a 
(41, a, a3) (x1 +44, X2 + dy, X3 + a3) 
a a a a 
(x1, X2, x3) 
y 
x a 
x 

Figure 1.4 Each arrow is a Figure 1.5 The vector 
parallel translate of the position a = (a1, a, a3) represented by an 
vector of the point (a1, a2, a3) and arrow with tail at the point 
represents the same vector. (x1, X2, X3). 


With this geometric description of vectors, vector addition can be visualized 
in two ways. The first is often referred to as the “head-to-tail” method for adding 
vectors. Draw the two vectors a and b to be added so that the tail of one of the 
vectors, say b, is at the head of the other. Then the vector sum a + b may be 
represented by an arrow whose tail is at the tail of a and whose head is at the head 
of b. (See Figure 1.6.) Note that it is not immediately obvious that a + b = b + a 
from this construction! 

The second way to visualize vector addition is according to the so-called 
parallelogram law: If a and b are nonparallel vectors drawn with their tails ema- 
nating from the same point, then a + b may be represented by the arrow (with its 
tail at the common initial point of a and b) that runs along a diagonal of the paral- 
lelogram determined by a and b (Figure 1.7). The parallelogram law is completely 
consistent with the head-to-tail method. To see why, just parallel translate b to the 
opposite side ofthe parallelogram. Then the diagonal just described is the result of 
adding a and (the translate of) b, using the head-to-tail method. (See Figure 1.8.) 

We still should check that these geometric constructions agree with our alge- 
braic definition. For simplicity, we’ll work in R?. Let a = (a1, a2)and b = (by, b2) 
as usual. Then the arrow obtained from the parallelogram law addition of a and 
b is the one whose tail is at the origin O and whose head is at the point P in 
Figure 1.9. If we parallel translate b so that its tail is at the head of a, then it is 
immediate that the coordinates of P must be (a; + b1, a2 + b2), as desired. 

Scalar multiplication is easier to visualize: The vector ka may be represented 
by an arrow whose length is |k| times the length of a and whose direction is the 
same as that of a when k > 0 and the opposite when k < 0. (See Figure 1.10.) 

It is now a simple matter to obtain a geometric depiction of the difference 
between two vectors. (See Figure 1.11.) The difference a — b is nothing more 


Figure 1.11 The 
geometry of vector 
subtraction. The vector c€ 
is such that b + ¢ = a. 
Hence, c = a — b. 


Figure 1.12 The displacement 
—> 


vector Pı P2, represented by the 
arrow from P to Ph, is the 
difference between the position 
vectors of these two points. 


1.1 | Vectors in Two and Three Dimensions 5 


(translated) 


Figure 1.7 The vector Figure 1.8 The equivalence of the 
a + b may be represented parallelogram law and the 
by the arrow that runs along head-to-tail methods of vector 
the diagonal of the addition. 
parallelogram determined 
by a and b. 
y 


Figure 1.9 The point P has coordinates Figure 1.10 Visualization of 
(a, + bi, a + b2). scalar multiplication. 


than a + (—b) (where —b means the scalar —1 times the vector b). The vector 

a — b may be represented by an arrow pointing from the head of b toward the 

head of a; such an arrow is also a diagonal of the parallelogram determined by a 

and b. (As we have seen, the other diagonal can be used to represent a + b.) 
Here is a construction that will be useful to us from time to time. 


DEFINITION 1.5 Given two points Pı(xı, yi, Z1) and P2(x2, yo, Z2) in R?, 
the displacement vector from P, to P is 


PP) = (%2 — X1, Y2 — yi, Z2 — Z1)- 


This construction is not hard to understand if we consider Figure 1.12. Given 
the points P; and P}, draw the corresponding position vectors O P; and O P3. 
Then we see that P; P is precisely OP,—OP,. An analogous definition may 
be made for R°. 


In your study of the calculus of one variable, you no doubt used the notions of 
derivatives and integrals to look at such physical concepts as velocity, acceleration, 
force, etc. The main drawback of the work you did was that the techniques involved 
allowed you to study only rectilinear, or straight-line, activity. Intuitively, we all 
understand that motion in the plane or in space is more complicated than straight- 
line motion. Because vectors possess direction as well as magnitude, they are 
ideally suited for two- and three-dimensional dynamical problems. 
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(a, h, a3) 


x 


Figure 1.13 After t seconds, the 
point starting at a, with velocity v, 
moves to a + fv. 


v current 
vı ship 
(with respect 
to still water) Net velocity 


Figure 1.14 The length of v; is 
15, and the length of vp is 5/2. 


> 200 Ib 


100 Ib 
200 Ib 


Figure 1.16 Vector addition in 
judo. 


For example, suppose a particle in space is at the point (a1, a2, a3) (with 
respect to some appropriate coordinate system). Then it has position vector a = 
(ai, a2, a3). If the particle travels with constant velocity v = (v1, v2, v3) for t 
seconds, then the particle’s displacement from its original position is tv, and its 
new coordinate position is a + tv. (See Figure 1.13.) 


EXAMPLE 4 Ifa spaceship is at position (100, 3, 700) and is traveling with 
velocity (7, —10, 25) (meaning that the ship travels 7 mi/sec in the positive 
x-direction, 10 mi/sec in the negative y-direction, and 25 mi/sec in the positive 
z-direction), then after 20 seconds, the ship will be at position 


(100, 3, 700) + 20(7, —10, 25) = (240, —197, 1200), 
and the displacement from the initial position is (140, —200, 500). + 


EXAMPLE 5 The S.S. Calculus is cruising due south at a rate of 15 knots 
(nautical miles per hour) with respect to still water. However, there is also a 
current of 54/2 knots southeast. What is the total velocity of the ship? If the ship 
is initially at the origin and a lobster pot is at position (20, —79), will the ship 
collide with the lobster pot? 

Since velocities are vectors, the total velocity of the ship is vı + v2, where vq is 
the velocity of the ship with respect to still water and v2 is the southeast-pointing 
velocity of the current. Figure 1.14 makes it fairly straightforward to compute 
these velocities. We have that vı = (0, —15). Since v2 points southeastward, its 
direction must be along the line y = —x. Therefore, v2 can be written as v2 = 
(v, —v), where v is a positive real number. By the Pythagorean theorem, if the 
length of vz is 5/2, then we must have v? + (—v)? = (5./2)* or 2v? = 50, so 
that v = 5. Thus, v2 = (5, —5), and, hence, the net velocity is 


After 4 hours, therefore, the ship will be at position 
(0, 0) + 4(5, —20) = (20, —80) 


and thus will miss the lobster pot. + 


EXAMPLE 6 The theory behind the venerable martial art of judo is an excel- 
lent example of vector addition. If two people, one relatively strong and the other 
relatively weak, have a shoving match, it is clear who will prevail. For example, 
someone pushing one way with 200 lb of force will certainly succeed in overpow- 
ering another pushing the opposite way with 100 lb of force. Indeed, as Figure 1.15 
shows, the net force will be 100 Ib in the direction in which the stronger person 
is pushing. 


100 Ib 200 Ib 100 Ib 


Figure 1.15 A relatively strong person pushing with a 
force of 200 Ib can quickly subdue a relatively weak one 
pushing with only 100 lb of force. 


Dr. Jigoro Kano, the founder of judo, realized (though he never expressed 
his idea in these terms) that this sort of vector addition favors the strong over the 
weak. However, if the weaker participant applies his or her 100 Ib of force in a 
direction only slightly different from that of the stronger, he or she will effect a 
vector sum of length large enough to surprise the opponent. (See Figure 1.16.) 
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This is the basis for essentially all of the throws of judo and why judo is described 
as the art of “using a person’s strength against himself or herself.’ In fact, the 
word “judo” means “the giving way.” One “gives in” to the strength of another by 
attempting only to redirect his or her force rather than to oppose it. + 
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10. 


11. 


. Sketch the following vectors in R?: 
(a) (2, 1) (b) (3, 3) (c) (1,2) 
. Sketch the following vectors in R?: 
(a) (1, 2,3) (b) (—2,0,2) (c) (2,-3,1) 


. Perform the indicated algebraic operations. Express 


your answers in the form ofa single vector a = (a4, a2) 
in R°. 

(a) 3,1) +(-1,7) 

(b) —2(8, 12) 

(c) (8,9) + 3(-1, 2) 

(d) (1, 1) + 5(2, 6) — 3(10, 2) 

(e) (8, 10) + 3((8, —2) — 2(4, 5)) 


. Perform the indicated algebraic operations. Express 


your answers in the form of a single vector a = 
(a1, a2, a3) in R°. 

(a) (2, 1,2) + (-3, 9, 7) 

(b) 3(8,4, 1) +26, -7, 7) 


(c) —2 ((2, 0, 1) — 6(5, —4, 1)) 


. Graph the vectors a = (1, 2), b = (—2, 5), and a+ 


b = (1, 2) + (-2, 5), using both the parallelogram law 
and the head-to-tail method. 


. Graph the vectors a = (3,2) and b = (—1, 1). Also 


calculate and graph a — b, 5a, and a+ 2b. 


. Let A be the point with coordinates (1, 0, 2), let B be 


the point with coordinates (—3, 3, 1), and let C be the 
point with coordinates (2, 1, 5). 


. — — 

(a) Describe the vectors AB and BA. 
> > > > 
(b) Describe the vectors AC, BC, and AC + CB. 
(c) Explain, with pictures, why AC + CB = AB. 


. Graph (1, 2, 1) and (0, —2, 3), and calculate and graph 


(1, 2, 1)+ (0, —2, 3), —1(1, 2, 1), and 4(1, 2, 1). 


. If (—12, 9, z) + (x, 7, —3) = (2, y, 5), what are x, y, 


and z? 


What is the length (magnitude) of the vector (3, 1)? 
(Hint: A diagram will help.) 


Sketch the vectors a = (1, 2) and b = (5, 10). Explain 
why a and b point in the same direction. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


Sketch the vectors a=(2,—7,8) and b=(-— 1, 
i, —4). Explain why a and b point in opposite 
directions. 


How would you add the vectors (1,2,3,4) and 
(5, —1, 2, 0) in R*? What should 2(7, 6, —3, 1) be? In 
general, suppose that 

a = (a1, d2,..., An) and b = (bı, b2, ..., bn) 
are two vectors in R” and k € R is a scalar. Then how 
would you define a + b and ka? 


Find the displacement vectors from P; to P2, where P, 
and P, are the points given. Sketch Pı, P2, and P,P». 
(a) Pi(1, 0, 2), Po(2, 1, 7) 

(b) Pi(1, 6, —1), P2(0, 4, 2) 

(c) P,(0, 4, 2), Po(1, 6, —1) 

(d PiG, 1), P, —1) 


Let Pı(2, 5, —1, 6) and P2(3, 1, —2, 7) be two points 
in R*. How would you define and calculate the dis- 
placement vector from Pı to P2? (See Exercise 13.) 


If A is the point in R? with coordinates (2, 5, —6) and 
the displacement vector from A to a second point B is 
(12, —3, 7), what are the coordinates of B? 


Suppose that you and your friend are in New York talk- 
ing on cellular phones. You inform each other of your 
own displacement vectors from the Empire State Build- 
ing to your current position. Explain how you can use 
this information to determine the displacement vector 
from you to your friend. 


Give the details of the proofs of properties 2 and 3 of 
vector addition given in this section. 


Prove the properties of scalar multiplication given in 
this section. 


(a) Ifa isa vector in R? or R?, what is 0a? Prove your 
answer. 

(b) Ifa is a vector in R? or RÌ, what is 1a? Prove your 
answer. 


(a) Let a = (2, 0) and b = (1, 1). For 0 < s < 1 and 
0 <t < 1, consider the vector x = sa + tb. Ex- 
plain why the vector x lies in the parallelogram 
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22. 


23. 
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determined by a and b. (Hint: It may help to draw 
a picture.) 

(b) Now suppose that a = (2, 2, 1) and b = (0, 3, 2). 
Describe the set of vectors {x = sa + tb |0 < s < 
1,0<t<l}. 


Let a= (a1, a2, a3) and b = (b1 , b2, b3) be two nonzero 
vectors such that b Æ ka. Use vectors to describe 
the set of points inside the parallelogram with vertex 
Po(xo, yo, zo) and whose adjacent sides are parallel to 
a and b and have the same lengths as a and b. (See 
Figure 1.17.) (Hint: If P(x, y, z) is a point in the par- 
allelogram, describe OP, the position vector of P.) 


x 


Figure 1.17 Figure for Exercise 22. 


A flea falls onto marked graph paper at the point (3, 2). 
She begins moving from that point with velocity vector 
v = (—1, —2) (i.e., she moves 1 graph paper unit per 
minute in the negative x-direction and 2 graph paper 
units per minute in the negative y-direction). 

(a) What is the speed of the flea? 

(b) Where is the flea after 3 minutes? 


(c) How long does it take the flea to get to the point 
(—4, —12)? 

(d) Does the flea reach the point (— 13, —27)? Why or 
why not? 


24. 


25. 


26. 


27. 


A plane takes off from an airport with velocity vector 

(50, 100, 4). Assume that the units are miles per hour, 

that the positive x-axis points east, and that the positive 

y-axis points north. 

(a) How fast is the plane climbing vertically at take- 
off? 


(b) Suppose the airport is located at the origin and a 
skyscraper is located 5 miles east and 10 miles 
north of the airport. The skyscraper is 1,250 feet tall. 
When will the plane be directly over the building? 


(c) When the plane is over the building, how much 
vertical clearance is there? 


As mentioned in the text, physical forces (e.g., gravity) 
are quantities possessing both magnitude and direction 
and therefore can be represented by vectors. Ifan object 
has more than one force acting on it, then the resul- 
tant (or net) force can be represented by the sum of 
the individual force vectors. Suppose that two forces, 
F; = (2,7, —1) and F; = (3, —2, 5), act on an object. 
(a) What is the resultant force of F; and F2? 


(b) What force F; is needed to counteract these forces 
(i.e., so that no net force results and the object 
remains at rest)? 


A 50 lb sandbag is suspended by two ropes. Suppose 

that a three-dimensional coordinate system is intro- 

duced so that the sandbag is at the origin and the ropes 

are anchored at the points (0, —2, 1) and (0, 2, 1). 

(a) Assuming that the force due to gravity points par- 
allel to the vector (0, 0,—1), give a vector F that 
describes this gravitational force. 


(b) Now, use vectors to describe the forces along each 
of the two ropes. Use symmetry considerations and 
draw a figure of the situation. 


A 10 lb weight is suspended in equilibrium by two 
ropes. Assume that the weight is at the point (1, 2, 3) 
in a three-dimensional coordinate system, where the 
positive z-axis points straight up, perpendicular to the 
ground, and that the ropes are anchored at the points 
(3, 0,4) and (0,3,5). Give vectors F; and F, that 
describe the forces along the ropes. 


1.2 More About Vectors 


The Standard Basis Vectors 


In R’, the vectors i = (1, 0) and j = (0, 1) play a special notational role. Any 
vector a = (a1, a2) may be written in terms of i and j via vector addition and 


scalar multiplication: 


(41, a2) = (a1, 0) + (0, a2) = a1, 0) + a2(0, 1) = ar it aj. 


(It may be easier to follow this argument by reading it in reverse.) Insofar as nota- 
tion goes, the preceding work simply establishes that one can write either (a1, a2) 


Figure 1.20 In R?, the equation 
y = 3 describes a line. 


Z 


Figure 1.21 In R, the equation 
y = 3 describes a plane. 


1.2 | More About Vectors 9 


y 


Senide a=ait+ aj 


mj 


Figure 1.18 Any vector in R? can be written in terms ofi and j. 


Z 


x 


Figure 1.19 Any vector in R? can be written in terms of i, j, and k. 


or aji + azj to denote the vector a. It’s your choice which notation to use (as long 
as you’re consistent), but the ij-notation is generally useful for emphasizing the 
“vector” nature of a, while the coordinate notation is more useful for emphasizing 
the “point” nature of a (in the sense of a’s role as a possible position vector of 
a point). Geometrically, the significance of the standard basis vectors i and j is 
that an arbitrary vector a € R? can be decomposed pictorially into appropriate 
vector components along the x- and y-axes, as shown in Figure 1.18. 

Exactly the same situation occurs in R*, except that we need three vec- 
tors, i = (1, 0, 0), j = (0, 1, 0), and k = (0, 0, 1), to form the standard basis. (See 
Figure 1.19.) The same argument as the one just given can be used to show that 
any vector a = (a1, a2, a3) may also be written as a; i+ a2 j + a3 k. We shall 
use both coordinate and standard basis notation throughout this text. 


EXAMPLE 1 We may write the vector (1,—2) as i— 2j and the vector 
(7, x, —3) as 7i+ mj — 3k. + 


Parametric Equations of Lines - 


In R?, we know that equations of the form y = mx + b or Ax + By = C describe 
straight lines. (See Figure 1.20.) Consequently, one might expect the same sort of 
equation to define a line in R? as well. Consideration of a simple example or two 
(such as in Figure 1.21) should convince you that a single such linear equation 
describes a plane, not a line. A pair of simultaneous equations in x, y, and z is 
required to define a line. 

We postpone discussing the derivation of equations for planes until §1.5 and 
concentrate here on using vectors to give sets of parametric equations for lines in 
R? or R? (or even R”). 
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y 


t=2/2 


t = 37/2 


Figure 1.22 The graph of the 
parametric equations x = 2 cost, 
y=2sint,0<t < 2r. 


x 


Figure 1.23 The line / is the 
unique line passing through Po and 
parallel to the vector a. 


Pye z 


x 


Figure 1.24 The graph of a line 
in R°. 


First, we remark that a curve in the plane may be described analytically 
by points (x, y), where x and y are given as functions of a third variable (the 
parameter) t. These functions give rise to parametric equations for the curve: 


x= f(t) 
y= g(t) © 
EXAMPLE 2 The set of equations 
o 0<t<2r 
y = 2sint 


describes a circle of radius 2, since we may check that 
x? + y? = (2 cost} + (2sint) = 4. 
(See Figure 1.22.) + 
Parametric equations may be used as readily to describe curves in R?; a curve 


in R? is the set of points (x, y, z) whose coordinates x, y, and z are each given by 
a function of t: 


x= f(t) 
y= g(t). 
z= ht) 


The advantages of using parametric equations are twofold. First, they offer a 
uniform way of describing curves in any number of dimensions. (How would 
you define parametric equations for a curve in R4? In R!?8?) Second, they allow 
you to get a dynamic sense of a curve if you consider the parameter variable ¢ to 
represent time and imagine that a particle is traveling along the curve with time 
according to the given parametric equations. You can represent this geometrically 
by assigning a “direction” to the curve to signify increasing t. Notice the arrow 
in Figure 1.22. 

Now, we see how to provide equations for lines. First, convince yourself that 
a line in R? or R? is uniquely determined by two pieces of geometric information: 
(1) a vector whose direction is parallel to that of the line and (2) any particular 
point lying on the line—see Figure 1.23. In Figure 1.24, we seek the vector 


r= OÈ 


between the origin O and an arbitrary point P on the line / (i.e., the position 
vector of P(x, y, z)). OP is the vector sum of the position vector b of the given 


point Po (i.e., O Po) and a vector parallel to a. Any vector parallel to a must be a 
scalar multiple of a. Letting this scalar be the parameter variable t, we have 


r= OP = OP, + ta, 


and we have established the following proposition: 


PROPOSITION 2.1 The vector parametric equation for the line through the point 
Po(bı, b2, b3), whose position vector is O Py = b = bii + b2j + b3k, and parallel 
to a = aji + aj + a3k is 


r(t) = b + ta. (1) 


x 


Figure 1.25 Finding equations 
for a line through two points in 
Example 4. 
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Expanding formula (1), 


r(t) = OP = bii + bj + bsk + t(aii + aj + ask) 


= (aıt + by it (ant + b2)j + (azt + b3)k. 


Next, write OP as xi + yj + zk so that P has coordinates (x, y, z). Then, ex- 
tracting components, we see that the coordinates of P are (aıt +b), azt + bo, 
a3t + b3) and our parametric equations are 


x=at+bd, 
y=at+b, (2) 
z = mt + b; 


where ¢ is any real number. 

These parametric equations work just as well in R? (if we ignore the z- 
component) or in R” where n is arbitrary. In R”, formula (1) remains valid, where 
we take a = (a1, d2,..., an) and b = (bj, b2, ..., bn). The resulting parametric 
equations are 


xı =at+b, 
x2 = mat + b2 


Xn = Ant + bn 


EXAMPLE 3 Tofindthe parametric equations ofthe line through (1, —2, 3) and 
parallel to the vector zi — 3j + k, we have a = zi — 3j + k and b = i — 2j + 3k 
so that formula (1) yields 


r(t) = i — 2j + 3k + t(ri — 3j + k) 
= (1 + at)ji+ (-2 — 3tj + 3 + t)k. 


The parametric equations may be read as 


x=xnt+1 
y=-3t-2. 
z=t+3 + 


EXAMPLE 4 From Euclidean geometry, two distinct points determine a unique 
line in R? or R°. Let’s find the parametric equations of the line through the points 
Po(1, —2, 3) and P;(0, 5, —1). The situation is suggested by Figure 1.25. To use 
formula (1), we need to find a vector a parallel to the desired line. The vector with 
tail at Po and head at P; is such a vector. That is, we may use for a the vector 


P,P, = (0 — 1, 5 — (—2), —1 — 3) = —i + 7j — 4k. 


For b, the position vector of a particular point on the line, we have the choice 
of taking either b = i — 2j + 3k or b = 5j — k. Hence, the equations in (2) yield 
parametric equations 


x=l1-t x=-t 
y=-2+7t or y=54+7t 
z=3-4t z =—l—4t + 
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In general, given two arbitrary points 
Po(a1, a2,a3) and Pi(bi, b2, b3), 
the line joining them has vector parametric equation 
r(t) = OP) +1 PoP. (3) 
Equation (3) gives parametric equations 


x =a, + (bı — a; )t 
y =m + (b2 — a )t . (4) 
Z=aat (b3 = a3)t 


Alternatively, in place of equation (3), we could use the vector equation 


r(t) = OP, +1 Poh, (5) 
or perhaps 
r(t) = OP, + 1P, P), (6) 


each of which gives rise to somewhat different sets of parametric equations. Again, 
we refer you to Figure 1.25 for an understanding of the vector geometry involved. 

Example 4 brings up an important point, namely, that parametric equations 
for a line (or, more generally, for any curve) are never unique. In fact, the two 
sets of equations calculated in Example 4 are by no means the only ones; we 
could have taken a = P P, =i— 7j + 4k or any nonzero scalar multiple of 
PoP for a. 

If parametric equations are not determined uniquely, then how can you check 
your work? In general, this is not so easy to do, but in the case of lines, there are 
two approaches to take. One is to produce two points that lie on the line specified 
by the first set of parametric equations and see that these points lie on the line 
given by the second set of parametric equations. The other approach is to use the 
parametric equations to find what is called the symmetric form of a line in R°. 
From the equations in (2), assuming that each a; is nonzero, one can eliminate 
the parameter variable t in each equation to obtain: 


x — bd, 
= 


ai 


The symmetric form is 
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In Example 4, the two sets of parametric equations give rise to corresponding 
symmetric forms 
x-1l yt2 27-3 x y-5 zt+l 
= = and = = . 
—1 7 —4 —1 7 —4 

It’s not difficult to see that adding 1 to each “side” of the second symmetric form 
yields the first one. In general, symmetric forms for lines can differ only by a 
constant term or constant scalar multiples (or both). 

The symmetric form is really a set of two simultaneous equations in RÌ. For 
example, the information in (7) can also be written as 


x—b, y-b 


a a2 


x — bd, _ k= by ` 


ai a3 


This illustrates that we require two “scalar” equations in x, y, and z to describe a 
line in R°, although a single vector parametric equation, formula (1), is sufficient. 

The next two examples illustrate how to use parametric equations for lines to 
identify the intersection of a line and a plane or of two lines. 


EXAMPLE 5 We find where the line with parametric equations 


x=t+5 
y=—2t—4 
z=3t+7 


intersects the plane 3x + 2y — 7z = 2. 

To locate the point of intersection, we must find what value of the parameter t 
gives a point on the line that also lies in the plane. This is readily accomplished by 
substituting the parametric values for x, y, and z from the line into the equation 
for the plane 


3+5) +2 Sr ee (8) 


Solving equation (8) for t, we find that t = —2. Setting t equal to —2 in the 
parametric equations for the line yields the point (3, 0, 1), which, indeed, lies in 
the plane as well. + 


EXAMPLE 6 We determine whether and where the two lines 


x=t+l1 x=3t-—3 
y=5t+6 and y=t 
= —2t z=tt+1 


intersect. 

The lines intersect provided that there is a specific value tı for the parameter 
of the first line and a value t for the parameter of the second line that generate the 
same point. In other words, we must be able to find t; and t so that, by equating 
the respective parametric expressions for x, y, and z, we have 


th+1 =3h-3 
5 +6=h . (9) 
—2t =h+1 
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A 


O 


Figure 1.27 The result of the 
wheel in Figure 1.26 rolling 
through a central angle of t. 


(a 
YY 


Figure 1.28 AP with its tail at 
the origin. 


The last two equations of (9) yield 
t= 5t)+6=-24,-1 > t=-1. 


Using t; = —1 in the second equation of (9), we find that t = 1. Note that the 
values t; = —1 and f = 1 also satisfy the first equation of (9); therefore, we have 
solved the system. Setting t = —1 in the set of parametric equations for the first 
line gives the desired intersection point, namely, (0, 1, 2). + 


Parametric Equations in General 


Vector geometry makes it relatively easy to find parametric equations for a variety 
of curves. We provide two examples. 


EXAMPLE 7 Ifa wheel rolls along a flat surface without slipping, a point on 
the rim of the wheel traces a curve called a cycloid, as shown in Figure 1.26. 


y 


Figure 1.26 The graph of a cycloid. 


Suppose that the wheel has radius a and that coordinates in R? are chosen so that 
the point of interest on the wheel is initially at the origin. After the wheel has 
rolled through a central angle oft radians, the situation is as shown in Figure 1.27. 


We seek the vector O È, the position vector of P, in terms of the parameter t. 
: > > . f 
Evidently, OP = OA + AP, where the point A is the center of the wheel. The 


vector O A is not difficult to determine. Its j-component must be a, since the center 
of the wheel does not vary vertically. Its i-component must equal the distance the 
wheel has rolled; if £ is measured in radians, then this distance is at, the length 


of the arc of the circle having central angle t. Hence, OA =ati+ aj. 
¿ —_ 
The value of vector methods becomes apparent when we determine AP. 


Parallel translate the picture so that AP has its tail at the origin, as in Figure 1.28. 
From the parametric equations of a circle of radius a, 


3 . . (30 . ek ; 
AP = acos 5 t | i+ asn 7 t |j = —asınti-— a costj, 


from the addition formulas for sine and cosine. We conclude that 


OP = OÀ + AP = (ati + aj) + (—a sinti — a cost j) 


= a(t — sin t)i + a(l — cos t)j, 


Generating 
circle 


Figure 1.31 The involute. 
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so the parametric equations are 
x = a(t — sint) 


y =a(l — cost) | + 


EXAMPLE 8 If you unwind adhesive tape from a nonrotating circular tape 
dispenser so that the unwound tape is held taut and tangent to the dispenser roll, 
then the end of the tape traces a curve called the involute of the circle. Let’s 
find the parametric equations for this curve, assuming that the dispensing roll 
has constant radius a and is centered at the origin. (As more and more tape is 
unwound, the radius of the roll will, of course, decrease. We’ll assume that little 
enough tape is unwound so that the radius of the roll remains constant.) 


Considering Figure 1.29, we see that the position vector OÈ of the desired 
point P is the vector sum OB + BP. To determine OB and BP, we use the angle 
— 
0 between the positive x-axis and O B as our parameter. Since B is a point on the 
circle, 


OB = acosĝi+asinðj. 


y y 
Unwound 
tape 
B 
a 
Involute zZ 
x 
ag 0- l2 
x 
oes — 
Figure 1.29 Unwinding tape, as Figure 1.30 The vector B P must 
in Example 8. The point P make an angle of 0 — 2/2 with the 


describes a curve known as the positive x-axis. 
involute of the circle. 


To find the vector BÈ, parallel translate it so that its tail is at the origin. Figure 1.30 


shows that BÈ ’s length must be a0, the amount of unwound tape, and its direction 
must be such that it makes an angle of 6 — 2/2 with the positive x-axis. From our 
experience with circular geometry and, perhaps, polar coordinates, we see that 


BP is described by 
N TNs P T\. : A . 
BP = ab cos (6 — S) i+ að sin (0 — =)i = af sinf i — a0 cos 0 j. 


Hence, 


OP = OŘ + BP = a(cos0 + 0 sin0)i + a(sin0 — 0 cos 0) j. 
So 


x = a(cos@ + 0 sin) 
y = a(sin 0 — 6 cos 0) 


are the parametric equations of the involute, whose graph is pictured in 
Figure 1.31. + 
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1.2 Exercises 


In Exercises 1—5, write the given vector by using the standard 
basis vectors for R? and R°. 


1. 
4. 


(2, 4) 2. (9, —6) 


5. (2,4, 0) 


3. (3,7, —7) 
(—1, 2,5) 


In Exercises 6—10, write the given vector without using the 
standard basis notation. 


6. 
7. 


12. 


i+j—3k 
9i — 2j + V2k 


. —3(2i — 7k) 
. mi — j (Consider this to be a vector in R?.) 
. i — j (Consider this to be a vector in R?.) 


. Leta; = (1, 1) and az = (1, —1). 


(a) Write the vector b = (3, 1) as c1aı + c2a2, where 
cı and c2 are appropriate scalars. 

(b) Repeat part (a) for the vector b = (3, —5). 

(c) Show that any vector b = (bı, b2) in R? may be 
written in the form ca + c2a2 for appropriate 
choices of the scalars c1, c2. (This shows that a; 
and a» form a basis for R? that can be used instead 
of i and j.) 

Let aj =(1, 0, —1), a2 = (0, 1, 0), and a3 = (1, 1, —1). 

(a) Find scalars c1, c2, c3, so as to write the vector 
b = (5, 6, —5) as c,a; + Ca) + €343. 

(b) Try to repeat part (a) for the vector b = (2, 3, 4). 
What happens? 

(c) Can the vectors a), a2, a3 be used as a basis for 
R?, instead of i, j, k? Why or why not? 


In Exercises 13-18, give a set of parametric equations for the 
lines so described. 


13. 


14. 


15. 


16. 


17. 


18. 
19. 


The line in R? through the point (2, —1, 5) that is par- 
allel to the vector i+ 3j — 6k. 


The line in R? through the point (12, —2, 0) that is 
parallel to the vector Si — 12j + k. 


The line in R? through the point (2, —1) that is parallel 
to the vector i — 7j. 

The line in R? through the points (2,1,2) and 
Gy —1, 5): 


The line in R? through the points (1,4,5) and 
(2,4, —1). 


The line in R? through the points (8, 5) and (1, 7). 


Write a set of parametric equations for the line in R* 
through the point (1, 2, 0, 4) and parallel to the vector 
(—2,.5,.3;7): 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


Write a set of parametric equations for the line in 
R through the points (9,7z,—1,5,2) and (—1, 1, 
2,7, 1). 


(a) Write a set of parametric equations for the line in 
R? through the point (—1, 7, 3) and parallel to the 
vector 2i — j + 5k. 


(b) Write a set of parametric equations for the line 
through the points (5, —3, 4) and (0, 1, 9). 


(c) Write different (but equally correct) sets of equa- 
tions for parts (a) and (b). 
(d) Find the symmetric forms of your answers in 
(a)(c). 
Give a symmetric form for the line having parametric 
equations x = 5 — 2t, y=3r+1,z=6t—4. 
Give a symmetric form for the line having parametric 
equations x = t + 7, y = 3t — 9,z = 6 — ŝt. 
A certain line in R? has symmetric form 
x-2 y=3 ztl 
5 -2 «4 
Write a set of parametric equations for this line. 


Give a set of parametric equations for the line with 
symmetric form 
x+5 a 
300 TD 


Are the two lines with symmetric forms 


x—1  yt2  z+1 


5 —3 4 
and 
x-4 y-1 z+5 
iG =<" 8 


the same? Why or why not? 

Show that the two sets of equations 

x-2 y-l z x+1 y+6 z+5 
= = and = = 

3 7 5 6 14 10 


actually represent the same line in R°. 


Determine whether the two lines /; and l defined by 
the sets of parametric equations lı: x = 2t — 5, y= 
3t+2,z=1 —-6t, and lg: x = 1 — 2t, y= 11 — 3, 
z = 6t — 17 are the same. (Hint: First find two points 
on l; and then see if those points lie on /.) 


Do the parametric equations l: x =3t +2, y= 
t—7, z=St+1, and h: x =6t—-1, y=2t—-8, 
z = 10t — 3 describe the same line? Why or why not? 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


Do the parametric equations x = 347 +7, y = 2 — t°, 
z = 5t? + 1 determine a line? Why or why not? 


Do the parametric equations x = 517 — 1, y = 2t? + 
3,z = 1 — £? determine a line? Explain. 

A bird is flying along the straight-line path x = 2t + 7, 
y = t —2,z = 1 — 3t, where t is measured in minutes. 


(a) Where is the bird initially (at t = 0)? Where is the 
bird 3 minutes later? 


(b) Give a vector that is parallel to the bird’s path. 
(c) When does the bird reach the point (#, 4,—44)? 
(d) Does the bird reach (17, 4, —14)? 


Find where the line x = 3t — 5, y = 2 — t, z = 6t in- 
tersects the plane x + 3y — z = 19. 
Where does the line x = 1 — 4t, y = t — 3/2, z= 


2t + 1 intersect the plane 5x — 2y +z = 1? 


Find the points of intersection of the line x = 2t — 3, 
y =3t +2, z= 5-— t with each of the coordinate 
planes x = 0, y = 0, and z = 0. 


Show that the line x = 5 — t, y = 2t — 7,z = t — 3 is 
contained in the plane having equation 2x — y + 4z = 5. 


Does the line x = 5 — t, y = 2t — 3,z = 7t + 1 inter- 
sect the plane x — 3y + z = 1? Why? 


Find where the line having symmetric form 


x-3 yt2 z 
6 3 5 
intersects the plane with equation 2x — 5y + 3z + 8 = 0. 


Show that the line with symmetric form 


=3 z+2 
* =y-5= 


—2 i 3 


lies entirely in the plane 3x + 3y + z = 22. 
Does the line with symmetric form 


x+4 y-2 z-i 
3 -l -9 


intersect the plane 2x — 3y + z = 7? 


Leta, b, c be nonzero constants. Show that the line with 
parametric equations x =at +a, y =b, z=ct+c 
lies on the surface with equation x?/a? + y?/b* — 
2/e = 1. 


Find the point of intersection of the two lines l1: x = 
2t+3, y=3t4+3, z= 2t4+1 and h:x = 15—-Tt, 
y=t—2,z=3t—7. 


Do the lines h:x =2t+1, y=-—3t, z=t-1 
and h:x =3t+1, y=t+5, z=7-—t intersect? 
Explain your answer. 


44. 


45. 


46. 
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(a) Find the distance from the point (—2, 1, 5) to any 
point on the line x =3t — 5, y = 1 — t, z= 4t + 7. 
(Your answer should be in terms of the param- 
eter t.) 


(b) Now find the distance between the point (—2, 1, 5) 
and the line x = 3t — 5, y = 1 — t, z = 4t +7. 
(The distance between a point and a line is the dis- 
tance between the given point and the closest point 
on the line.) 


(a) Describe the curve given parametrically by 


x = 2 cos3t 27 
0< 
y =2sin3t 


What happens if we allow t to vary between 0 
and 27? 


(b) Describe the curve given parametrically by 


x = 5cos3t 27 
: 0< 
y = 5sin3t 


(c) Describe the curve given parametrically by 


x = 5sin3t 27 
0< 
y = 5cos3t 


(d) Describe the curve given parametrically by 


x = 5cos3t 27 
. 0< 
y =3sin3t 


Suppose that a bicycle wheel of radius a rolls along a 
flat surface without slipping. If a reflector is attached 
to a spoke of the wheel at a distance b from the center, 
the resulting curve traced by the reflector is called a 
curtate cycloid. One such cycloid appears in Fig- 
ure 1.32, where a = 3 and b = 2. 


X 


} t x 


27 4r 


Figure 1.32 A curtate cycloid. 


Using vector methods or otherwise, find a set of para- 
metric equations for the curtate cycloid. Figure 1.33 
should help. (Take a low point of the cycloid to lie 
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Figure 1.34 A prolate cycloid. 


y 
47. Egbert is unwinding tape from a circular dispenser of 
radius a by holding the tape taut and perpendicular to 
Z DAN the dispenser. Find a set of parametric equations for 
A the path traced by the end of the tape (the point P in 
á AL Figure 1.35) as Egbert unwinds the tape. Use the angle 
P 0 between OP and the positive x-axis for parameter. 
O Assume that little enough tape is unwound so that the 
radius of the dispenser remains constant. 


Figure 1.33 The point P traces a 
curtate cycloid. 


on the y-axis.) There is no theoretical reason that the 
cycloid just described cannot have a < b, although in 
such case the bicycle-wheel-reflector application is no 
longer relevant. (When a < b, the parametrized curve 
that results is called a prolate cycloid.) Your paramet- 
ric equations should be such that the constants a and b 
can be chosen independently of one another. An exam- 
ple ofa prolate cycloid, witha = 2 andb = 4, is shown 
in Figure 1.34. Try to think of a physical situation in 
which such a curve would arise. 


Figure 1.35 Figure for Exercise 47. 


1.3 The Dot Product 


When we introduced the arithmetic notions of vector addition and scalar mul- 
tiplication, you may well have wondered why the product of two vectors was 
not defined. You might think that “vector multiplication” should be defined in a 
manner analogous to the way we defined vector addition (1.e., by componentwise 
multiplication). However, such a definition is not very useful. Instead, we shall 
define and use two different concepts of a product of two vectors: (1) the Eu- 
clidean inner product, or “dot” product, which may be defined for two vectors in 
R” (where n is arbitrary) and (2) the “cross” or vector product, which is defined 
only for vectors in R. 
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The Dot Product of Two Vectors 


DEFINITION 3.1 Let a = (a), a2, a3) and b = (bı, b2, b3) be two vectors 
in R°. The dot (or inner or scalar) product of a and b, denoted a- b, is 


a-b = abı + ab + a3b3. 
In R?, the analogous definition is 
a-b= a,b, a abs, 
where a = (a1, dz) and b = (bj, bo). 


EXAMPLE 1 In R?, we have 


(1, -2, 5) (2, 1, 3) = (DB) + (-2)(1) + (5)(3) = 15. 
Gi + 2j — k)- (i — 2k) = 8X1) + (2X0) + (-1)(-2) = 5. Ps 


In accordance with its name, the dot—or scalar—product takes two vectors 
and produces a single real number (not a vector). 
The following facts are consequences of Definition 3.1: 


Properties of dot products. Ifa, b, and c are any vectors in R? (or R?) and 
k € R is any scalar, then 

1. a-a > 0, anda-a = 0 ifand only if a = 0; 

2. a-b=b-a; 

3. a-(b+c)=a-b+a-c; 

4. (ka)-b =k(a-b) = a- (kb). 


Proof of Property 1 Ifa = (a1, az, a3), then we have 
a.a = adıdı + a242 + 4343 = +a +a. 


This last expression is evidently nonnegative, since it is a sum of squares of real 
numbers. Moreover, such an expression is zero exactly when each of the terms is 
zero, that is, if and only if aj = az = a3 = 0. E 


We leave the proofs of properties 2, 3, and 4 as exercises. 
Thus far, we have introduced the dot product of two vectors as a purely 


algebraic construction. It is the geometric interpretation of the definition that is 
really interesting. To establish this interpretation, we begin with the following: 


DEFINITION 3.2 If a= (a), a, a3), then the length of a (also called the 
norm or magnitude), denoted ||all, is /a? + a3 + a3. 
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a 


Figure 1.36 The dot 
product of a and b is 
llall || bl] cos 6. 


Figure 1.37 The vector 
triangle used in the proof 
of Theorem 3.3. 


The motivation for this definition is evident if we draw a as the position vector of 
the point (a1, a2, a3). Then the length of the arrow from the origin to (a1, a2, a3) is 


Vai — 0% + (a — 0} + (a — 0}, 


as given by the distance formula, which is nothing more than an extension of the 
Pythagorean theorem in the plane. As we just saw, a- a = a? + a} + a3, and we 
have 


llall = va- a 


or, equivalently, 


2 
a.a = |lall~. 


Now we’re ready to state the main result concerning the geometry of the dot 
product. If a and b are two nonzero vectors in R? (or R?) drawn with their tails at 
the same point, let 0, where 0 < 6 < x, be the angle between a and b. If either a 
or b is the zero vector, then 6 is indeterminate (i.e., can be any angle). 


THEOREM 3.3 Ifaand b are any two vectors in either R? or R3, then 
a-b = |lal] ||bl| cos 6. 
(See Figure 1.36.) 


PROOF If either a or b is the zero vector, say a, then a = (0, 0, 0) and so 
a+b = (0)(b1) + (0)(b2) + (0)(b3) = 0. 
Also, ||a|] = 0 in this case, so the formula in Theorem 3.3 holds. In this case, the 
angle @ is indeterminate. 
Now suppose that neither a nor b is the zero vector. Let c = b — a. Then 
we may apply the law of cosines to the triangle whose sides are a, b, and c 
(Figure 1.37) to obtain 


lell? = llall? + Ibl? — 2llall ||bl| cos. 
Thus, 
2ljall bl] cos = llall? + llb]? — le? =a-a+b-b—c-e, (2) 
from equation (1). Now, use the properties of the dot product. Since ec = b — a, 
c-c = (b — a). (b — a) 
= (b— a)-b — (b — a). a 
=þb-b—a-b-—b-.-a+a:a, (3) 


by properties 3 and 4 of the dot product. If we use equation (3) to substitute for 
c+ c in equation (2), then 


2|lal| ||b|| cos = a-a+b-b—(b-b—a-b—b-a+a-a) 
=a-b+b-a 
= 2a-b, 


by property 2 of the dot product. By canceling the factor of 2 on both sides, the 
desired result is obtained. a 


2kg 


Figure 1.38 An object 
sliding down a ramp. 
The force due to gravity 
is downward, but the 
direction of travel of the 
object is inclined 30° to 
the horizontal. 
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Angles Between Vectors 


Theorem 3.3 may be used to find the angle between two nonzero vectors a and 
b—just solve for 8 in the formula in Theorem 3.3 to obtain 


_; arb 


llall {bl 


The use of the inverse cosine is unambiguous, since we take 0 < 0 < a when 
defining angles between vectors. 


EXAMPLE 2 Ifa = i+ jand b = j — k, then formula (4) gives 


p eo) a h = cos™! = = cos™! E == + 
li + jll Ilj — il (V2 + /2) 2 3 


If a and b are nonzero, then Theorem 3.3 implies 


0 = cos 


cos =0 ifandonlyif a-b = 0. 


We have cos 0 = 0 just in case 0 = x /2. (Remember our restriction on 6.) Hence, 
it makes sense for us to call a and b perpendicular (or orthogonal) when a- b = 
0. If either a or b is the zero vector, then we cannot use formula (4), and the angle 
0 is undefined. Nonetheless, since a: b = 0 if a or b is 0, we adopt the standard 
convention and say that the zero vector is perpendicular to every vector. 


EXAMPLE 3 The vector i+ j is orthogonal to the vector i — j + k, since 
ü+) G-j+ k =()0)+ C1) + 0) = 0. + 


Vector Projections 


Suppose that a 2 kg object is sliding down a ramp having a 30° incline with the 
horizontal as in Figure 1.38. If we neglect friction, the only force acting on the 
object is gravity. What is the component of the gravitational force in the direction 
of motion of the object? 

To answer questions of this nature, we need to find the projection of one vector 
on another. The general idea is as follows: Given two nonzero vectors a and b, 
imagine dropping a perpendicular line from the head of b to the line through a. 
Then the projection of b onto a, denoted proj, b, is the vector represented by the 
arrow in Figure 1.39. 


proj,b proj,b 


Figure 1.39 Projection of the vector b onto the vector a. 
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Given this intuitive understanding of the projection, we find a precise formula 
for it. Recall that a vector is determined by magnitude (length) and direction. It 
follows by definition that the direction of proj,b is either the same as that of a, 
or opposite to a if the angle 0 between a and b is more than 2/2. Trigonometry 
then tells us 


| projab | 
Ib 


(The absolute value sign around cos @ is needed in case 17/2 < 0 < x.) Hence, 
with a bit of algebra, we have 


|cos ð| = 


lall [|bl| |cos@| _ [a-b] 


|projab|| = Ilbl] |cos 8| jaj Jaj 


by Theorem 3.3. Thus, we know the magnitude and direction of proj, b. To obtain 
a compact formula for proj, b, note the following: 


PROPOSITION 3.4 Let k be any scalar and a any vector. Then 


1. [kal] = |k] llall. 
2. A unit vector (i.e., a vector of length 1) in the direction of a nonzero vector 
a is given by a/|lal]. 


PROOF Part 1 is left as an exercise. (Write out ka and ||ka|| in terms of compo- 
nents.) For part 2, we must check that the length of a/|lal| is 1: 


a 1 1 
= |— al = — lal =1, 
llall llall llall 
by part 1 (since 1/||aļ| is a positive scalar). E 


Now proj,b is a vector of length |a- b|/||a]| in the “+a-direction.” That is, 


. ja- b| a llall |lb]| |cos@| a 
projab = + x — = ; 
llall llall llall llall 
length of _ unit vector 
projab in direction of a 


Note that the angle 6 keeps track of the appropriate sign of proj, b; that is, when 
0 <6 < 2/2, cos@ is positive and proj, b points in the direction of a, and when 
m/2 <0 <7, cos6@ is negative and proj, b points in the direction opposite to that 
of a. Thus, we can eliminate both the + sign and the absolute value, and we find 
that 


; llall ||bl| cos@ a a-b 
rojab = 


a 


= a 
lali lall fall? 


by Theorem 3.3, so that 


by equation (1). Formula (5) is concise and not difficult to remember. 


2kg 


30° 


Figure 1.40 The 2 kg 
object sliding down a 
ramp in Example 4. 
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EXAMPLE 4 To answer the question posed at the beginning of this subsection, 
we need to calculate proj, F, where F is the gravitational force vector and a points 
along the ramp as shown in Figure 1.40. We have a coordinate situation as shown 
in Figure 1.41. From trigonometric considerations, we must have a = aji + aj 
such that a; = —|lal| cos 30° and a = —|la|| sin30°. Since we are really only 
interested in the direction of a, there is no loss in assuming that a is a unit vector. 
Thus, 


a = — cos 30° i — sin30° j = -F i- 5j. 


F = -mgj =-19.6j 


Figure 1.41 The vectors a and F in Example 4, 
realized in a coordinate system. 


Taking g = 9.8 m/sec?, we have F = —2gj = —19.6j. Therefore, formula (5) 
implies 


V3 l o . 
, a- F (—-Gi-5j)-(-196j) / V3, 1 
F = — = i j 
PrOJa & Ja i 2 93 

ve 1 

=9.8|-—i--j 

( 2° oA 


x —8.49i — 4.9], 
and the component of F in this direction is 
|proj, F || = ||—8.49i — 4.9 j|| = 9.8 N. + 


Unit vectors—that is, vectors of length 1—are important in that they capture 
the idea of direction (since they all have the same length). Part 2 of Proposition 
3.4 shows that every nonzero vector a can have its length adjusted to give a unit 
vector u = aą/||a|| that points in the same direction as a. This operation is referred 
to as normalization of the vector a. 


EXAMPLE 5 A fluid is flowing across a plane surface with uniform velocity 
vector v. If n is a unit vector perpendicular to the plane surface, let’s find (in terms 
of v and n) the volume of the fluid that passes through a unit area of the plane in 
unit time. (See Figure 1.42.) 
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ae | 
D E 


Figure 1.42 Fluid flowing across Figure 1.43 After one unit of time, the fluid passing 
a plane surface. across a square will have filled the box. 


First, imagine one unit of time has elapsed. Then over a unit area of the plane 
(say over a unit square), the fluid will have filled a “box” as in Figure 1.43. The 
box may be represented by a parallelepiped (a three-dimensional analogue of a 
parallelogram). The volume we seek is the volume of this parallelepiped and is 


Volume = (area of base)(height). 


The area of the base is 1 unit by construction. The height is given by ||proj,,v|l. 
From formula (5), 


f n-v 
proja Y = (—) n = (n-v)n, 
nen 
since n -n = ||n||? = 1. Hence, 
||proj,v|] = I@ -vall = |n - v| [In|] = [n +v], 


by part 1 of Proposition 3.4. + 


Vector Proofs 


We conclude this section with two illustrations of how wonderfully well vectors 
are suited to providing elegant proofs of geometric results. 


EXAMPLE 6 In an arbitrary triangle, show that the line segment joining the 
midpoints of two sides is parallel to and has half the length of the third side. (See 
Figure 1.44.) In other words, if Mı is the midpoint of side AB and M; is the 
midpoint of side AC, we wish to show that M, M; is parallel to BC and has half 


its length. 
A A 
M, M, M; M, 
B C B C 
Figure 1.44 In triangle ABC, Figure 1.45 The vector version 
M, Mj is parallel to BC and has of triangle ABC in Example 6. 
half its length. 


For a vector proof, we use the diagram in Figure 1.45, a slightly modified ver- 
sion of Figure 1.44. The midpoint conditions translate to the following statements 


about vectors: 
AM, =14B, AM, = 1AC. 
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Now, 


MM, = AM, — AM, = 1AC — LAB = 1(AC — AB) = 1 BC. 


But Mı M> = LBC is precisely what we wish to prove: To say Mı Mo is a scalar 


. — 
times BC means that the two vectors are parallel. Moreover, from part 1 of 
Proposition 3.4, 


—> —> 
IMMI = 15 BCI] = }IBČI, 
so that the length condition also holds. + 


EXAMPLE 7 Show that every angle inscribed in a semicircle is a right angle, 
as suggested by Figure 1.46. 


N 7 
N Z 
a Pig 
Figure 1.46 Every angle Figure 1.47 a and b are “radius 
inscribed in a semicircle is a vectors.” 
right angle. 


To prove this remark, we’ll make use of Figure 1.47, where a and b are “radius 
vectors” with tails at the center of the circle. We need only show that a — b (a 
vector along one ray ofthe angle in question) is perpendicular to —a — b (a vector 
along the other ray). In other words, we wish to show that 


(a — b) - (~a — b) = 0. 


We have 
(a — b)-(—a— b) = (—1)(a — b) » (a + b), 


by property 4 of dot products, 
= (—!1)((a — b) - a + (a — b) - b) 
=(-l)(a-a—b-a+a-b—b-b) 
= (—1)(llall? — [1b1)”), 


by properties 2 and 4, 
=Ó; 


since both a and b are radius vectors (and therefore have the same length, namely, 
the radius of the circle). + 


Vector proofs as in Examples 6 and 7 are elegant and sometimes allow you to 
write shorter and more direct proofs than those from your high school geometry 
days. 
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Exercises 


Compute a » b, ||all, ||b|| for the vectors listed in Exercises 1—6. 
1. a= (1, 5), b = (—2, 3) 
2. a= (4, —1), b = (5,2) 
3. a= (-1,0, 7), b = (2, 4, —6) 
4. a=(2,1,0), b = (1, —2, 3) 
5. a=4i—3j+k,b=i+j+k 
6. a=i+2j—k,b = —3j + 2k 


In Exercises 7—11, find the angle between each of the pairs of 


vectors. 
7.a=VJ3i+j,b=—-V3i4+j 
8. a= (—1, 2), b = (3, 1) 
9.a=i+j,b=i+jt+k 
10. a=i+j—k,b=—i+ 2j + 2k 
11. a= (1, —2, 3), b = (3, —6, —5) 


In Exercises 12—16, calculate proj,b. 


12. 
13. 
14. 


15 


16. 
17. 


18. 


19. 


20. 


21. 


22. 
23. 
24. 


a=i+j, b=2i+3j—k 
a= (i+p/V2, b=2i+ 3j-k 
a = 5k,b = i — j+ 2k 

. a= —3k,b=i— j+ 2k 
a=i+j+2k, b=2i—4j+k 


Give a unit vector that points in the same direction as 
the vector 2i — j +k. 


Give a unit vector that points in the direction opposite 
to the vector —i + 2k. 


Give a vector of length 3 that points in the same direc- 
tion as the vector i+ j — k. 


Find three nonparallel vectors that are perpendicular 
toi—j+k. 

Is it ever the case that proj,b = proj,a? If so, under 
what conditions? 

Prove properties 2, 3, and 4 of dot products. 

Prove part 1 of Proposition 3.4. 


Suppose that a force F = i — 2j is acting on an object 
moving parallel to the vector a = 4i + j. Decompose F 
into a sum of vectors F; and F2, where F; points along 
the direction of motion and F; is perpendicular to the 
direction of motion. (Hint: A diagram may help.) 


25. 


26. 


27. 


In physics, when a constant force acts on an object 
as the object is displaced, the work done by the force 
is the product of the length of the displacement and 
the component of the force in the direction of the dis- 
placement. Figure 1.48 depicts an object acted upon by 
a constant force F, which displaces it from the point P 
to the point Q. Let 0 denote the angle between F and 
the direction of displacement. 

(a) Show that the work done by F is determined by the 

formula F - PO. 


(b) Find the work done by the (constant) force F = 
i+ 5j+ 2k in moving a particle from the point 
(1, —1, 1) to the point (2, 0, —1). 


P Q 


Component of F in direction 
of displacement 


Figure 1.48 A constant force F 
displaces the object from P to Q. (See 
Exercise 25.) 


A refrigerator is dragged 12 ft across a smooth floor 
using a rope and 60 lb of force directed along the rope. 
How much work is done if the rope makes a 20° angle 
with the horizontal? 


How much work is done in pushing a handtruck loaded 
with 500 lb of bananas 40 ft up a ramp inclined 30° 
from horizontal? 


Let a be a nonzero vector in R°. The direction cosines of a are 
the three numbers cos a, cos B, cos y determined by the angles 
a, B, y between a and, respectively, the positive x-, y-, and 
z-axes. In Exercises 28 and 29, find the direction cosines of the 
given vectors. 


28. 
29. 
30. 


31. 


a=i+2j—k 
a= 3i + 4k 


Ifa = aji + aj + azk, give expressions for the direc- 
tion cosines of a in terms of the components of a. 


Let A, B, and C denote the vertices of a triangle. Let 
0 <r < 1. If P; is the point on AB located r times the 
distance from A to B and P, is the point on AC located 
r times the distance from A to C, use vectors to show 
that P, P, is parallel to BC and has r times the length 
of BC. (This result generalizes that of Example 6 of 
this section.) 


32. 


33. 


34. 


35. 


Let A, B, C, and D be four points in R? such that no 
three of them lie on a line. Then ABCD is a quadri- 
lateral, though not necessarily one that lies in a plane. 
Denote the midpoints of the four sides of ABCD by 
Mı, M2, M3, and M4. Use vectors to show that, amaz- 
ingly, Mı M2 M; M; is always a parallelogram. 


Use vectors to show that the diagonals of a parallel- 
ogram have the same length if and only if the paral- 
lelogram is a rectangle. (Hint: Let a and b be vectors 
along two sides of the parallelogram. Express vectors 
running along the diagonals in terms of a and b. See 
Figure 1.49.) 


a 


Figure 1.49 Diagram for Exercise 33. 


Using vectors, prove that the diagonals of a parallelo- 
gram are perpendicular if and only if the parallelogram 
is a rhombus. (Note: A rhombus is a parallelogram 
whose four sides all have the same length.) 


This problem concerns three circles of equal radius r 
that intersect in a single point O. (See Figure 1.50.) 
(a) Let Wi, W2, and W3 denote the centers of the 
—> 
three circles and let w; = OW; for i = 1,2,3. 
Similarly, let A, B, and C denote the remaining 
— 
intersection points of the circles and seta = OA, 
—= — ; 
b = OB, andc = OC. By numbering the centers 
of the circles appropriately, write a, b, and ¢ in 
terms of w1, W2, and w3. 
(b) Show that A, B, and C lie on a circle of the 
same radius r as the three given circles. (Hint: 
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The center of the circle is at the point P, where 
— 
OP = wi + w + w3.) 


O 


Figure 1.50 Two examples 
of three circles of equal radius 
intersecting in a single 

point O. (See Exercise 35.) 


(c) Show that O is the orthocenter of triangle ABC. 
(The orthocenter of a triangle is the common in- 
tersection point of the altitudes perpendicular to 
the edges.) 


36. (a) Show that the vectors ||b||a-+ ||allb and ||blla — 


||al|b are orthogonal. 


(b) Show that ||b||a + ||al|b bisects the angle between 
a and b. 


1.4 The Cross Product 


The cross product of two vectors in R? is an “honest” product in the sense that it 
takes two vectors and produces a third one. However, the cross product possesses 
some curious properties (not the least of which is that it cannot be defined for 
vectors in R? without first embedding them in R? in some way) making it less 
“natural” than may at first seem to be the case. 

When we defined the concepts of vector addition, scalar multiplication, and 
the dot product, we did so algebraically (i.e., by a formula in the vector compo- 
nents) and then saw what these definitions meant geometrically. In contrast, we 
will define the cross product first geometrically, and then deduce an algebraic for- 
mula for it. This technique is more convenient, since the coordinate formulation 
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Figure 1.51 The area of this 
parallelogram is ||a|| ||b|| sin 8. 


b 


Figure 1.52 The right-hand 
rule for finding a x b. 


Figure 1.53 ix j=k. 


k 


Figure 1.54 A mnemonic for 
finding the cross product of the 
unit basis vectors. 


is fairly complicated (although we will find a way to organize it so as to make it 
easier to remember). 


The Cross Product of Two Vectors in R? 


DEFINITION 4.1 Let a and b be two vectors in R? (not R?). The cross 
product (or vector product) of a and b, denoted a x b, is the vector whose 
length and direction are given as follows: 


° The length of a x b is the area of the parallelogram spanned by a and b 
or is zero if either a is parallel to b or if a or b is 0. Alternatively, the 
following formula holds: 


lla x bl] = [lal] ||b]] sin 8, 
where 0 is the angle between a and b. (See Figure 1.51.) 


¢ The direction of a x b is such that a x b is perpendicular to both a and 
b (when both a and b are nonzero) and is taken so that the ordered triple 
(a,b,a x b) is a right-handed set of vectors, as shown in Figure 1.52. 
(if either a or b is 0, or if a is parallel to b, then a x b = 0 from the 
aforementioned length condition.) 


By saying that (a, b, a x b) is right-handed, we mean that if you let the fingers 
of your right hand curl from a toward b, then your thumb will point in the 
direction of a x b. 


EXAMPLE 1 Lets compute the cross product of the standard basis vectors 
for R°. First consider i x j as shown in Figure 1.53. The vectors i and j deter- 
mine a square of unit area. Thus, ||i x j|] = 1. Any vector perpendicular to both 
i and j must be perpendicular to the plane in which i and j lie. Hence, i x j 
must point in the direction of +k. The “right-hand rule” implies that i x j must 
point in the positive k direction. Since ||k|| = 1, we conclude that i x j = k. The 
same argument establishes that j x k = i and k x i = j. To remember these ba- 
sic equations, you can draw i, j, and k in a circle, as in Figure 1.54. Then the 
relations 


ixj=k, jxk=i, 


kxi=j 


may be read from the circle by beginning at any vector and then proceeding 
clockwise. + 


Properties of the Cross Product; Coordinate Formula 


Example 1 demonstrates that the calculation of cross products from the geometric 
definition is not entirely routine. What we really need is a coordinate formula, 
analogous to that for the dot product or for vector projections, which is not difficult 
to obtain. 
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From our Definition 4.1, it is possible to establish the following: 


Properties of the Cross Product. Leta, b, and c be any three vectors in R? 
and let k € R be any scalar. Then 

1. ax b= —b x a (anticommutativity); 

2. ax (b+ c¢)= a x b + a x ce (distributivity); 

3. (a+ b) x c= a x c+ b x ce (distributivity); 

4. k(a x b)= (ka) x b = a x (kb). 


We provide proofs of these properties at the end of the section, although you 
might give some thought now as to why they hold. It’s worth remarking that these 
properties are entirely reasonable, ones that we would certainly want a product to 
have. However, you should be clear about the fact that the cross product fails to 
satisfy other properties that you might also consider to be eminently reasonable. 
In particular, since property 1 holds, we see that a x b ~ b x a in general (i.e., 
the cross product is not commutative). Consequently, be very careful about the 
order in which you write cross products. Another property that the cross product 
does not possess is associativity. That is, 


a x (b x c) Æ (a x b) x c, 
in general. For example, let a = b = i and ¢ = j. Then 
ix(ixj)=ix k= —kxi= -j, 


from properties 1 and 4, but (i x i)x j= 0 x j= 0+ —j. (The equation 
i x i = 0 holds because i is, of course, parallel to i.) Make sure that you do 
not try to use an associative law when working problems. 

We now have the tools for producing a coordinate formula for the cross 
product. Let a = ai + a2j + a3k and b = bii + b2j + b3k. Then 


a x b = (ai + aj + a3K) x (bii + b2j + b3k) 


= (aji + aj + a3k) x bii + (aii + a2j + a3k) x bj 
+ (aii + aj + a3K) x b3k, 
by property 2, 
= abii x i + abıj x i + azbık x i + ahi x j + arb2j x j 
+a3bzk x j + a,b3i x k + a2b3j x k + a3b3k x k, 


by properties 3 and 4. These nine terms may look rather formidable at first, but 
we can simplify by means of the formulas in (1), anticommutativity, and the fact 
that e x e = 0 for any vector ¢ € R*. (Why?) Thus, 


ax b = —abik 4 azbij + abak — a3boi aib3j + a2b3i 
= (mab; — az3b2)i + (a3bı — aıb3)j + (aıb2 — a2b1)k. 2) 


EXAMPLE 2 Formula (2) gives 
(i +3j-— 2k) x (2i + 2k) = G+ 2 — (2) - 0i + (2-2-1: 2j 
+(1-0-—3-2)k 
= 6i — 6j — 6k. + 
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Formula (2) is more complicated than the corresponding formulas for all the 
other arithmetic operations of vectors that we’ve seen. Moreover, it is a rather dif- 
ficult formula to remember. Fortunately, there is a more elegant way to understand 
formula (2). We explore this reformulation next. 


Matrices and Determinants: A First Introduction 


A matrix is a rectangular array of numbers. Examples of matrices are 


3 
2 7|, and 
0 0 


oreo 
i) 


1 0 
0 1 
0 0 
0 0 


If a matrix has m rows and n columns, we call it “m x n” (read “m by n”). 
Thus, the three matrices just mentioned are, respectively, 2 x 3,3 x 2,and4 x 4. 
To some extent, matrices behave algebraically like vectors. We discuss some 
elementary matrix algebra in §1.6. For now, we are mainly interested in the notion 
of a determinant, which is a real number associated to ann x n (square) matrix. 
(There is no such thing as the determinant of a nonsquare matrix.) In fact, for the 
purposes of understanding the cross product, we need only study 2 x 2 and3 x 3 
determinants. 


DEFINITION 4.2 Let A bea2 x 2 or3 x 3 matrix. Then the determinant 
of A, denoted det A or | A], is the real number computed from the individual 
entries of A as follows: 


ea acaAse 
@ |p a b 
ra =|? a| hen Al = P 4| = ad — be 
o 3 3¢ 2 Gage 
@ i © 
IA = lla @ Ff |, tinea 
@ ip i 


ab e 
|AJ=|d e f |=aei+bfg + cdh — ceg —afh — bdi 
In a 


es if de 
Bs nae a j 


& 


e 
h 


in terms of 2 x 2 determinants. 


Perhaps the easiest way to remember and compute 2 x 2 and 3 x 3 determi- 
nants (but not higher-order determinants) is by means of a “diagonal approach.” 
We write (or imagine) diagonal lines running through the matrix entries. The 
determinant is the sum of the products of the entries that lie on the same diagonal, 
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where negative signs are inserted in front of the products arising from diagonals 
going from lower left to upper right: 


e 2 x 2 case 


and 
|A| = ad — bc. 


e 3 x 3 case (we need to repeat the first two columns for the method to work) 


b c 
e f 
h i 


Write 


Then 


|A| = aei + bfg + cdh — gec — hfa — idb. 


IMPORTANT WARNING This mnemonic device does not generalize beyond 
3 x 3 determinants. 


We now state the connection between determinants and cross products. 


Key Fact. Ifa = aji + aj + a3k and b = bii + b2j + b3k, then 
i j k 

k= o Gy G5 |. (3) 
bi bo b; 


Co a3). Gh Gaia a, a2 
bi b 


axb= oer i- By b j+ 


The determinants arise from nothing more than rewriting formula (2). Note 
thatthe 3 x 3 determinant in formula (3) needs to be interpreted by using the 2 x 2 
determinants that appear in formula (3). (The 3 x 3 determinant is sometimes 
referred to as a “symbolic determinant.”) 


EXAMPLE 3 
i j 
Gi+2j-k)x(i—-j+k)=|3 2 -1 
pai 4 
Fal EIE 
=| 24 i-li hli ac 
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We may also calculate the 3 x 3 determinant as 


= 2i— j — 3k — 2k — i — 3j = i — 4j — 5k. 


Areas and Volumes 


Cross products are used readily to calculate areas and volumes of certain objects. 
We illustrate the ideas involved with the next two examples. 


EXAMPLE 4 Let’s use vectors to calculate the area of the triangle whose ver- 
tices are A(3, 1), B(2, —1), and C(0, 2) as shown in Figure 1.55. 


y 
G 
A 
x x 
B 
Figure 1.55 Triangle ABC in Figure 1.56 Any triangle may be 
Example 4. considered to be half of a 


parallelogram. 


The trick is to recognize that any triangle can be thought of as half of a 
parallelogram (see Figure 1.56) and that the area of a parallelogram is obtained 
from a cross product. In other words, AB x AC isa vector whose length measures 
the area of the parallelogram determined by AB and AC , and so 


Area of AABC = 1\|AB x ACI. 


To use the cross product, we must consider AB and AC to be vectors in R°. This 
is straightforward: We simply take the k-components to be zero. Thus, 


AÈ = —i— 2j = —i — 2j — 0k, 


and 
AČ = -3i + j = -3i + j + 0k 
Therefore, 
_, i j k 
AË x AČ=| -1 -2 0|=-7k. 
—3 1 0 
Hence, 
Area of AABC = i||- 7k|| = 4. è 


Figure 1.57 The area of AABC 
is 7/2. 
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There is nothing sacred about using A as the common vertex. We could just 
as easily have used B or C, as shown in Figure 1.57. Then 


Area of AABC = 4||BA x BÈI = tG + 2) x (2i +3) = 4I7k|| = 2. 
EXAMPLE 5 Find a formula for the volume of the parallelepiped determined 
by the vectors a, b, and c. (See Figure 1.58.) 


A 
axb 


lel [cos 6l 


Figure 1.58 The parallelepiped determined by a, b, 
and c. 


As explained in §1.3, the volume of a parallelepiped is equal to the product 
of the area of the base and the height. In Figure 1.58, the base is the parallelogram 
determined by a and b. Hence, its area is ||a x b||. The vector a x b is perpendi- 
cular to this parallelogram; the height of the parallelepiped is ||e|| |cos 0|, where 8 
is the angle between a x b and c. (The absolute value is needed in case 0 > 77/2.) 
Therefore, 


Volume of parallelepiped = (area of base)(height) 
= ||a x b|| lel] [cos] 


= |(a x b)» cl. 


(The appearance of the cos@ term should alert you to the fact that dot products 
are lurking somewhere.) 
For example, the parallelepiped determined by the vectors 


a=i+5j, b= —4i+2j, and c=i+j+6k 
has volume equal to 
(G + Sj) x (—4i + 2j)) -G + j + 6k)| = |22k- (i+ j + 6k)| 
= |22(6)| 
= 132. + 
The real number (a x b) -c appearing in Example 5 is known as the triple 


scalar product of the vectors a, b, and c. Since |(a x b)» c| represents the volume 
of the parallelepiped determined by a, b, and c, it follows immediately that 


(a x b). c| = |(b x c)- al = |(c x a)- DI. 
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|F] ja 0 


Figure 1.59 Turning a bolt with a 
wrench. The torque on the bolt is 
the vector r x F. 


] 
Figure 1.60 A 


potato spinning 
about an axis. 


j 
Figure 1.61 The 


angular velocity 
vector @. 


In fact, if you are careful with the right-hand rule, you can convince yourself that 
the absolute value signs are not needed; that is, 
(a x b). c = (b x ¢)-a = (c x a) + b. (4) 

This is a nice example of how the geometric significance of a quantity can provide 
an extremely brief proof of an algebraic property the quantity must satisfy. (Try 
proving it by writing out the expressions in terms of components to appreciate 
the value of geometric insight.) 

We leave it to you to check the following beautiful (and convenient) formula 
for calculating triple scalar products: 


a, a a3 
(ax b)-c= bi bz b3 ; 
Cy C2 C3 


where a = aii + aj + a3k, b = bii + b2j + b3k, and ¢ = cii + c2j + c3k. 


Torque 


Suppose you use a wrench to turn a bolt. What happens is the following: You 
apply some force to the end of the wrench handle farthest from the bolt and that 
causes the bolt to move in a direction perpendicular to the plane determined by 
the handle and the direction of your force (assuming such a plane exists). To 
measure exactly how much the bolt moves, we need the notion of torque (or 
twisting force). 

In particular, letting F denote the force you apply to the wrench, we have 


Amount of torque = (length of wrench)(component of F L wrench). 


Let r be the vector from the center of the bolt head to the end of the wrench 
handle. Then 


Amount of torque = ||r|| ||F || sin 8, 


where 0 is the angle between r and F. (See Figure 1.59.) That is, the amount of 
torque is ||r x F||, and it is easy to check that the direction of r x F is the same 
as the direction in which the bolt moves (assuming a right-handed thread on the 
bolt). Hence, it is quite natural to define the torque vector T to be r x F. The 
torque vector T is a concise way to capture the physics of this situation. 

Note that if F is parallel to r, then T = 0. This corresponds correctly to the 
fact that if you try to push or pull the wrench, the bolt does not turn. 


Rotation of a Rigid Body - 


Spin an object (a rigid body) about an axis as shown in Figure 1.60. What is the 
relation between the (linear) velocity of a point of the object and the rotational 
velocity? Vectors provide a good answer. 

First we need to define a vector œ, the angular velocity vector of the rotation. 
This vector points along the axis of rotation, and its direction is determined by 
the right-hand rule. The magnitude of @ is the angular speed (measured in radians 
per unit time) at which the object spins. Assume that the angular speed is constant 
in this discussion. Next, fix a point O (the origin) on the axis of rotation, and let 
r(t) = OP be the position vector of a point P of the body, measured as a function 
of time, as in Figure 1.61. The velocity v of P is defined by 

Ar 


v= lim —, 
At>0 At 


Figure 1.62 A spinning rigid 
body. 


Figure 1.63 A carousel wheel. 
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where Ar = r(t + At) — r(t) (.e., the vector change in position between times t 
and t + At). Our goal is to relate v and w. 

As the body rotates, the point P (at the tip of the vector r) moves in a circle 
whose plane is perpendicular to œ. (See Figure 1.62, which depicts the motion 
of such a point of the body.) The radius of this circle is ||r(t)|| sin@, where 0 is 
the angle between œ and r. Both ||r(t)|| and 6 must be constant for this rotation. 
(The direction of r(t) may change with t, however.) If At ~ 0, then ||Ar|] is 
approximately the length of the circular arc swept by P between ¢ and t + At. 
That is, 


|| Ar|| ~ (radius of circle)(angle swept through by P) 
= (|r| sin@)(Ag) 


from the preceding remarks. Thus, 


Ar . Ag 
— || © |r|] sinô —. 
At At 


Now, let At > 0. Then Ar/At —> v and Ag/At > ||o|| by definition of the 
angular velocity vector w, and we have 


Ivi = loll [rl] sin@ = ||@ x rl. (5) 


It’s not difficult to see intuitively that v must be perpendicular to both @ and r. 
A moment’s thought about the right-hand rule should enable you to establish the 
vector equation 


V=oxFYr. (6) 


If we apply formula (5) to a bicycle wheel, it tells us that the speed of a point 
on the edge of the wheel is equal to the product of the radius of the wheel and 
the angular speed (6 is 2/2 in this case). Hence, if the rate of rotation is kept 
constant, a point on the rim of a large wheel goes faster than a point on the rim 
of a small one. In the case of a carousel wheel, this result tells you to sit on an 
outside horse if you want a more exciting ride. (See Figure 1.63.) 


Summary of Products Involving Vectors - 


Following is a collection of some basic information concerning scalar multipli- 
cation of vectors, the dot product, and the cross product: 


Scalar Multiplication: ka 


Result is a vector in the direction of a. 

Magnitude is ||Ka|| = |k] Ilall. 

Zero ifk = Oora=0. 

Commutative: ka = ak. 

Associative: k(la) = (kl)a. 

Distributive: k(a + b) = ka + kb; (k + l)a = ka + la. 
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Dot Product: a-b 


Result is a scalar. 

Magnitude is a- b = ||al| ||b|| cos 0; 0 is the angle between a and b. 
Magnitude is maximized if a || b. 

Zero ifa L b,a=0,orb=0. 

Commutative: a -b = b-a. 

Associativity is irrelevant, since (a+ b) +c doesn’t make sense. 
Distributive: a-(b+c)=a-b+a-c. 

Ifa = b, then a» a = |lall’. 


Cross Product: a x b 


Result is a vector perpendicular to both a and b. 

Magnitude is ||a x b|| = |jall ||b]|| sin 8; 6 is the angle between a and b. 
Magnitude is maximized ifa L b. 

Zero ifa || b, a = 0, or b = 0. 

Anticommutative: a x b = —b x a. 

Not associative: In general, a x (b x c) Æ (a x b) xc. 


Distributive: a x (b + ¢) = a x b + a x cand 
(a+b)xc=axec+bxe. 
Ifa L b, then ||a x b|| = |lal] || bl]. 


Addendum: Proofs of Cross Product Properties 


Proof of Property 1 To prove the anticommutativity property, we use the right- 
hand rule. Since 


lla x b|| = lial] || bl] sing, 


we obviously have that ||a x b|| = ||b x al]. Therefore, we need only understand 
the relation between the direction of a x b and that of b x a. To determine the 
direction of a x b, imagine curling the fingers of your right hand from a toward b. 
Then your thumb points in the direction of a x b. If instead you curl your fingers 
from b toward a, then your thumb will point in the opposite direction. This is the 
direction of b x a, so we conclude that a x b = —b x a. (See Figure 1.64.) E 


axb 


E 


Figure 1.64 The right-hand rule shows why a x b = —b x a. 
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Proof of Property 2 First, note the following general fact: 


PROPOSITION 4.3 Leta and b be vectors in R°. If a -x = b-x for all vectors x 
in R?, then a = b. 


To establish Proposition 4.3, write a as aji + a2j + a3k and b as bii + b2j + 
b3k and set x in turn equal to i, j, and k. Proposition 4.3 is valid for vectors in R? 
as well as R°. 

To prove the distributive law for cross products (property 2), we show that, 
for any x € Rô, 


(a x (b + ¢))-x = (a x b + a x ¢)- x. 


By Proposition 4.3, property 2 follows. 
From the equations in (4), 
(ax(b+c¢).-x=(xxa). (b+c) 
= (xx a)-b+(xxa)-ce, 


from the distributive law for dot products, 


= (ax b)-x+(axc)-x 
=(axb+axc)-x, 


again using (4) and the distributive law for dot products. E 


Proof of Property 3 Property 3 follows from properties 1 and 2. We leave the 
details as an exercise. E 


Proof of Property 4 The second equality in property 4 follows from the first 
equality and property 1: 


k(a x b) = —k(b x a) by property 1 
= —(kb) xa by the first equality of property 4 
= a x (kb) by property 1. 


Hence, we need only prove the first equality. 

If either a or b is the zero vector or if a is parallel to b, then the first equality 
clearly holds. Otherwise, we divide into three cases: (1) k = 0, (2) k > 0, and 
(3) k < 0. If k = 0, then both ka and k(a x b) are equal to the zero vector and 
the desired result holds. If k > 0, the direction of (ka) x b is the same as a x b, 
which is also the same as k(a x b). Moreover, the angle between ka and b is the 
same as between a and b. Calling this angle 0, we check that 


I|(Ka) x bl] = ||kal] ||b]| sin é 
= kjall ||b|| sind by part 1 of Proposition 3.4 
= k\la x bl by Definition 4.1 
= ||k(a x b)|| by part 1 of Proposition 3.4. 


We conclude (ka) x b = k(a x b) in this case. 
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b Ifk < 0, then the direction of (ka) x b is the same as that of (—a) x b, which 
is seen to be the same as that of —(a x b) and thus the same as that of k(a x b). 
m-0 The angle between ka and b is therefore m — 0, where 0 is the angle between a 
= 2 and b. (See Figure 1.65.) Thus, 
ka, k <0 ka, k>0 I|(Ka) x b|| = ||kall || bl] sinG@r — 6) = Ikl Jal] || bl] sin 6 = ||k(a x b)I]. 


Figure 1.65 Ifthe angle between 
a and b is 0, then the angle 
between ka and b is either 0 (if 

k > 0)orm — 90 (ifk < 0). 


1.4 Exercises 


Evaluate the determinants in Exercises 1—4. 


2 4 5 
TE | 9 5 
1 3 5 2 0 5 
3 0 2 7 
4 = 
-1 0 3 2 eo 
4 -8 2 


In Exercises 5—7, calculate the indicated cross products, using 


both formulas (2) and (3). 
5. (1,3, —2) x (-1,5, 7) 
6. (3i— 2j + k) x (i+j+ k) 
7. (i+ j) x (—3i+ 2j) 


8. Prove property 3 of cross products, using properties 1 
and 2. 


9. Ifa x b = 3i — 7j — 2k, what is (a+ b) x (a — b)? 
10. Calculate the area of the parallelogram having vertices 
(1, 1), (, 2), C, 3), and (—1, 2). 
11. Calculate the area of the parallelogram having vertices 
(1, 2, 3), (4, —2, 1), (—3, 1, 0), and (0, —3, —2). 
12. Find a unit vector that is perpendicular to both 2i + 
j— 3kandi+ k. 


13. If (a x b)» c = 0, what can you say about the geomet- 
ric relation between a, b, and c? 


Compute the area of the triangles described in Exercises 
14-17. 


14. The triangle determined by the vectors a = i + j and 
b=2i-j 

15. The triangle determined by the vectors a = i — 2j + 
6k and b = 4i + 3j — k 

16. The triangle having vertices (1,1), (—1,2), and 
(=2; m 1) 


17. The triangle having vertices (1,0, 1), (0, 2,3), and 
(—1, 5, —2) 


18. 


19. 


20. 


21. 
22. 


23. 


24. 


So, again, it follows that (ka) x b = k(a x b). | 


Find the volume of the parallelepiped determined by 
a = 3i — j, b = —2i + k, ande = i — 2j + 4k. 
What is the volume of the parallelepiped with vertices 
(3,0, —1), (4,2, —1), (-1, 1,0), (3, 1,5), (0,3, 0), 
(4, 3, 5), (—1, 2, 6), and (0, 4, 6)? 

a, a2 a 
Verify that (a x b)-c=| bı b2 by 

Cy CO G 
Show that (a x b)» c = a-(b X c) using Exercise 20. 
Use geometry to show that |(axb)-e|/= 
|b-(a xc). 
(a) Show that the area of the triangle with vertices 


P(x, y1), P2(x2, y2), and P3(x3, y3) is given by 
the absolute value of the expression 


1 1 1 
=| Xi X2 X3 
VI <Y2: Y3 


(b) Use part (a) to find the area of the triangle with 
vertices (1, 2), (2, 3), and (—4, —4). 

Suppose that a, b, and ¢ are noncoplanar vectors in R, 

so that they determine a tetrahedron as in Figure 1.66. 


c 


a 
Figure 1.66 The tetrahedron of 
Exercise 24. 


Give a formula for the surface area of the tetrahedron 
in terms of a, b, and c. (Note: More than one formula 
is possible.) 
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. Suppose that you are given nonzero vectors a, b, and c 
in R°. Use dot and cross products to give expressions for 
vectors satisfying the following geometric descriptions: 


(a) A vector orthogonal to a and b 

(b) A vector of length 2 orthogonal to a and b 

(c) The vector projection of b onto a 

(d) A vector with the length of b and the direction of a 

(e) A vector orthogonal to a and b x ¢ 

(f) A vector in the plane determined by a and b and 
perpendicular to c. 


Suppose a, b, c, and d are vectors in R?. Indicate which 
of the following expressions are vectors, which are 
scalars, and which are nonsense (i.e., neither a vector 
nor a scalar). 

(a) (ax b) xe 

(c) (a-b) x (e-d) 
(e) (a+b) x (c xd) 
(g) (a x b)» (c x d) 


(b) (a-b)-e 

(d) (a x b)-e 

(f) a x [(b - c)d] 
(h) (a+ b)e — (a x b) 


Exercises 27—32 concern several identities for vectors a, b, ¢, 


and d 


in R°. Each of them can be verified by hand by writing 


the vectors in terms of their components and by using formula 
(2) for the cross product and Definition 3.1 for the dot product. 
However, this is quite tedious to do. Instead, use a computer 
algebra system to define the vectors a, b, c, and d in general 
and to verify the identities. 


@ 27 


Q 28. 


@ 29. 


Q 30. 


@ 31. 
Q 32. 


33. 


34. 


35. 


. (a x b) x c = (a-c)b—(b-c)a 
a. (b xc) = b-(c X a) =c-(a xb) 
= —a-(c x b) = —c-(b x a) 


= —b-(axc) 
(a x b)» (c x d)=(a-c)(b- d) — (a: d)(b-c) 
_ | are a-d 
~ | bee bed 


(ax b)xe+(b xc) xXa+(c Xa) xX b= 0 (this is 
known as the Jacobi identity). 


(a x b) x (ec x d) = [a- (c x d)]b — [b-(c x d)la 
(a x b)» (b x ¢) x (c x a) = [a+ (b x c) 
Establish the identity 

(a x b)» (c x d) = (a-c)(b-d) — (a: d)(b-c) 


of Exercise 29 without resorting to a computer algebra 
system by using the results of Exercises 27 and 28. 


Egbert applies a 20 lb force at the edge of a 4 ft 
wide door that is half-open in order to close it. (See 
Figure 1.67.) Assume that the direction of force is per- 
pendicular to the plane of the doorway. What is the 
torque about the hinge on the door? 


Gertrude is changing a flat tire with a tire iron. The tire 
iron is positioned on one of the bolts of the wheel so 


36. 


37. 


39 
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|F|| = 20 Ib 


4 ft P 


Figure 1.68 The configuration for 
Exercise 35. 


that it makes an angle of 30° with the horizontal. (See 

Figure 1.68.) Gertrude exerts 40 lb of force straight 

down to turn the bolt. 

(a) If the length of the arm of the wrench is 1 ft, how 
much torque does Gertrude impart to the bolt? 


(b) What if she has a second tire iron whose length is 
18 in? 


Egbert is trying to open a jar of grape jelly. The ra- 
dius of the lid of the jar is 2 in. If Egbert imparts 15 1b 
of force tangent to the edge of the lid to open the jar, 
how many ft-lb, and in what direction, is the resulting 
torque? 


A 50 1b child is sitting on one end of a seesaw, 3 ft 
from the center fulcrum. (See Figure 1.69.) When she is 


Figure 1.69 The seesaw of Exercise 37. 


40 


38. 


39. 


40. 
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1.5 ft above the horizontal position, what is the amount 
of torque she exerts on the seesaw? 


For this problem, note that the radius of the earth is 

approximately 3960 miles. 

(a) Suppose that you are standing at 45° north latitude. 
Given that the earth spins about its axis, how fast 
are you moving? 

(b) How fast would you be traveling if, instead, you 
were standing at a point on the equator? 


Archie, the cockroach, and Annie, the ant, are on an 
LP record. Archie is at the edge of the record (ap- 
proximately 6 in from the center) and Annie is 2 in 
closer to the center of the record. How much faster is 
Archie traveling than Annie? (Note: A record playing 
on a turntable spins at a rate of 334 revolutions per 
minute.) 


A top is spinning with a constant angular speed of 12 
radians/sec. Suppose that the top spins about its axis 


41. 


of symmetry and we orient things so that this axis is 

the z-axis and the top spins counterclockwise about it. 

(a) If, at a certain instant, a point P in the top has 
coordinates (2, —1, 3), what is the velocity of the 
point at that instant? 


(b) What are the (approximate) coordinates of P one 
second later? 


There is a difficulty involved with our definition of 
the angular velocity vector @, namely, that we cannot 
properly consider this vector to be “free” in the sense 
of being able to parallel translate it at will. Consider 
the rotations of a rigid body about each of two parallel 
axes. Then the corresponding angular velocity vectors 
@, and @, are parallel. Explain, perhaps with a fig- 
ure, that even if @, and @, are equal as “free vectors,” 
the corresponding rotational motions that result must 
be different. (Therefore, when considering more than 
one angular velocity, we should always assume that the 
axes of rotation pass through a common point.) 


1.5 Equations for Planes; Distance Problems 


x 


Figure 1.70 The plane in R 
through the point Pp and 
perpendicular to the vector n. 


In this section, we use vectors to derive analytic descriptions of planes in R?. We 
also show how to solve a variety of distance problems involving “flat objects” 
(i.e., points, lines, and planes). 


Coordinate Equations of Planes 
A plane TI in R? is determined uniquely by the following geometric information: 
a particular point Po(xo, yo, Zo) in the plane and a particular vector n = Ai + 
Bj + Ck that is normal (perpendicular) to the plane. In other words, TI is the 
set of all points P(x, y, z) in space such that PoP is perpendicular to n. (See 
Figure 1.70.) This means that IT is defined by the vector equation 


Since PoP = (x — xo)i + (y — yo)j + (z — zo)k, equation (1) may be rewritten 


as 


(Ai + Bj + Ck)-((x — xo)i + (y — yo)j + (z — zo)k) = 0 


or 


A(x — xo) + Bly — yo) + CK 


Zo) = 0. 


This is equivalent to 
Ax+ By+Cz=D, 
where D = Axo + Byo + Czo. 
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EXAMPLE 1 Theplane through the point (3, 2, 1) withnormal vector 2i — j + 4k 
has equation 
(2i— j+ 4k)-((x — 3)i+ (vy — 2)j +(e — Dk) = 0 
=> 2(x — 3) -—(y —2) +42 -1) =0 
4> 2x -y+4z=8. + 


Not only does a plane in R? have an equation of the form given by equation 
(2), but, conversely, any equation of this form must describe a plane. Moreover, 
it is easy to read off the components of a vector normal to the plane from such an 
equation: They are just the coefficients of x, y, and z. 


EXAMPLE 2 Given the plane with equation 7x + 2y — 3z = 1, find a normal 
vector to the plane and identify three points that lie on that plane. 

A possible normal vector is n = 7i + 2j — 3k. However, any nonzero scalar 
multiple ofn will do just as well. Algebraically, the effect ofusing a scalar multiple 
of n as normal is to multiply equation (2) by such a scalar. 

Finding three points in the plane is not difficult. First, let y = z = 0 in the 
defining equation and solve for x: 


7x+2-0-3-0=1 = Tx=1l = x=}. 


Thus (5, 0, 0) is a point on the plane. Next, let x = z = 0 and solve for y: 


7-04+2y-3-0=1 4 y=}. 


So (0, 5, 0) is another point on the plane. Finally, let x = y = 0 and solve for z. 


You should find that (0, 0, — +) lies on the plane. + 


EXAMPLE 3 Put coordinate axes on R? so that the z-axis points vertically. 
Then a plane in R? is vertical if its normal vector n is horizontal (i.e., if n is 
parallel to the xy-plane). This means that n has no k-component, so n can be 
written in the form Ai + Bj. It follows from equation (2) that a vertical plane has 
an equation of the form 


A(x — xo) + B(y — yo) = 0. 
Hence, a nonvertical plane has an equation of the form 
A(x — xo) + BO — yo) + C(Z — zo) = 0, 
where C Æ 0. + 


EXAMPLE 4 From high school geometry, you may recall that a plane is 
determined by three (noncollinear) points. Let’s find an equation of the plane 
that contains the points Po(1, 2, 0), P)(3, 1, 2), and P2(0, 1, 1). 

There are two ways to solve this problem. The first approach is algebraic 
and rather uninspired. From the aforementioned remarks, any plane must have 
an equation of the form Ax + By + Cz = D for suitable constants A, B, C, and 
D. Thus, we need only to substitute the coordinates of Po, Pı, and P into this 
equation and solve for A, B, C, and D. We have that 


e substitution of Py gives A+2B = D; 
e substitution of P; gives 3A + B + 2C = D; and 
e substitution of P) gives B + C = D. 
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Figure 1.71 The plane 
determined by the points Po, Pi, 
and P2 in Example 4. 


Hence, we must solve a system of three equations in four unknowns: 


A+2B =D 
3A+ B+2C=D. (3) 
B+ C=D 


In general, such a system has either no solution or else infinitely many solutions. 
We must be in the latter case, since we know that the three points Po, P1, and Pz 
lie on some plane (i.e., that some set of constants A, B, C, and D must exist). 
Furthermore, the existence of infinitely many solutions corresponds to the fact 
that any particular equation for a plane may be multiplied by a nonzero constant 
without altering the plane defined. In other words, we can choose a value for one 
of A, B, C, or D, and then the other values will be determined. So let’s multiply 
the first equation given in (3) by 3, and subtract it from the second equation. We 
obtain 


A+ 2B = D 
=5B +2C==2D . (4) 
B+ C= D 
Now, multiply the third equation in (4) by 5 and add it to the second: 
A+2B = D 
1G=3D. (5) 
B+ C= D 
Multiply the third equation appearing in (5) by 2 and subtract it from the first: 
A —2C = -D 
7C = 3D. (6) 


Bae C= D 


By adding appropriate multiples of the second equation to both the first and third 
equations of (6), we find that 


7C= 3D. (7) 


Thus, if in (7) we take D = —7 (for example), then A = 1, B = —4, C = —3, 
and the equation of the desired plane is 
x — 4y —3z=-7. 

The second method of solution is cleaner and more geometric. The idea is 
to make use of equation (1). Therefore, we need to know the coordinates of a 
particular point on the plane (no problem—we are given three such points) and 

sm 

a vector n normal to the plane. The vectors Py P; and Po P» both lie in the plane. 
(See Figure 1.71.) In particular, the normal vector n must be perpendicular to 


them both. Consequently, the cross product provides just what we need. That is, 
we may take 


n= PoP; x PaP, = Qi- j+2k)x(-i—-j+k) 


i j k 
=| 2—1 2ļ|=i—4j-3k. 
T i 


y 


Figure 1.72 The line of 
intersection of the planes 
x—2y+z = 4and 

2x + y +3z = —7 in Example 5. 
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If we take Po(1, 2, 0) to be the particular point in equation (1), we find that the 
equation we desire is 


Gi — 4j — 3k) -((« — 1)i + (y — 2)j + zk) = 0 
or 
(x — 1)—4(y — 2) —3z =0. 
This is the same equation as the one given by the first method. + 


EXAMPLE 5 Consider the two planes having equations x — 2y + z = 4 and 
2x + y + 3z = —7. We determine a set of parametric equations for their line of 
intersection. (See Figure 1.72.) We use Proposition 2.1. Thus, we need to find a 
point on the line and a vector parallel to the line. To find the point on the line, 
we note that the coordinates (x, y, z) of any such point must satisfy the system of 
simultaneous equations given by the two planes 


(8) 


x—2y+z= 4 
2x +y +3z=-7 


From the equations given in (8), it is not too difficult to produce a single 
solution (x, y, z). For example, if we let z = 0 in (8), we obtain the simpler 


system 
x—2y= 4 
Ooo (9) 
2x+y=-7 
The solution to the system of equations (9) is readily calculated to be x = —2, 
y = —3. Thus, (—2, —3, 0) are the coordinates of a point on the line. 


To find a vector parallel to the line of intersection, note that such a vector 
must be perpendicular to the two normal vectors to the planes. The normal vectors 
to the planes are i — 2j + k and 2i + j + 3k. Therefore, a vector parallel to the 
line of intersection is given by 


(i—2j+k)x Qit+j 


Hence, Proposition 2.1 implies that a vector parametric equation for the line is 


3k) = -7i 


j+ 5k. 


r(t) = (—2i — 3j) + t(—7i — j + 5k), 
and a standard set of parametric equations is 
x=-—īt-—2 
y=-t—3 
zZ=5t + 


Parametric Equations of Planes 


Another way to describe a plane in R? is by a set of parametric equations. First, 
suppose that a = (a), a2, a3) and b = (bı, bo, b3) are two nonzero, nonparallel 
vectors in R?. Then a and b determine a plane in R? that passes through the 
origin. (See Figure 1.73.) To find the coordinates of a point P(x, y, z) in this 
plane, draw a parallelogram whose sides are parallel to a and b and that has two 
opposite vertices at the origin and at P, as shown in Figure 1.74. Then there must 


exist scalars s and ¢ so that the position vector of P is OP = sa + tb. The plane 
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Figure 1.75 The plane passing 
through Po(c1, c2, c3) and parallel 
to a and b. 


A\ 


y 
xX 
X 
Figure 1.73 The plane through Figure 1.74 For the point P in 
the origin determined by the the plane shown, O P = sa + tb 
vectors a and b. for appropriate scalars s and ¢. 


may be described as 
fx € R? |x=sa+tb;s,t €R}. 
Now, suppose that we seek to describe a general plane TI (i.e., one that does 
not necessarily pass through the origin). Let 
c= (ci; C2, ¢3) = OP 


denote the position vector of a particular point Pp in T and let a and b be two 
(nonzero, nonparallel) vectors that determine the plane through the origin parallel 
to II. By parallel translating a and b so that their tails are at the head of e (as in 
Figure 1.75), we adapt the preceding discussion to see that the position vector of 
any point P(x, y, z) in II may be described as 

OP =sa +tbh+e. 


To summarize, we have shown the following: 


PROPOSITION 5.1 A vector parametric equation for the plane TI containing 
the point Po(c1, c2, c3) (whose position vector is O Py = c) and parallel to the 
nonzero, nonparallel vectors a and b is 


x(s, t) = sa + tb +c. (10) 


By taking components in formula (10), we readily obtain a set of parametric 
equations for IT: 


Compare formula (10) with that of equation (1) in Proposition 2.1. We need 
to use two parameters s and ¢ to describe a plane (instead of a single parameter 
t that appears in the vector parametric equation for a line) because a plane is a 
two-dimensional object. 


Figure 1.76 A general 
configuration for finding the 
distance between a point and a 
line, using vector projections. 


Figure 1.77 Another general 
configuration for finding the 
distance between a point and a 
line. 
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EXAMPLE 6 We find a set of parametric equations for the plane that passes 
through the point (1, 0, — 1) and is parallel to the vectors 3i — k and 2i + 5j + 2k. 
From formula (10), any point on the plane is specified by 


x(s, t) = si—k) + t(2i + 5j + 2k) + (i — k) 
= (3s + 2t + l)i + 5tj + (2t — s — Dk. 


The individual parametric equation may be read off as 


x=3s+2t+1 
y=5t 
z=2t—s—1 + 


Distance Problems 


Cross products and vector projections provide convenient ways to understand a 
range of distance problems involving lines and planes: Several examples follow. 
What is important about these examples are the vector techniques for solving 
geometric problems that they exhibit, not the general formulas that may be derived 
from them. 


EXAMPLE 7 (Distance between a point and a line) We find the distance 
between the point Po(2, 1, 3) and the line I(t) = t(—1, 1, —2) + (2, 3, —2) in two 
ways. 


METHOD 1. From the vector parametric equations for the given line, we read 
off a point B on the line—namely, (2, 3, —2)—and a vector a parallel to the 
line—namely, a = (—1, 1, —2). Using Figure 1.76, the length of the vector 
B Po — proj, B Po provides the desired distance between Po and the line. Thus, 
we calculate that 


BP, = 2, 1,3) — (2,3, —2) 


=] (0, —2, 5); 
a BB 
proj, B Py = (? ) 
a'a 


- (2 1,1,—2)-(0, = 1,1, -2 
(—1,1,—2)-(—1, 1, —2) 


= (2, —2, 4). 
The desired distance is 
— . —> 
|| B Py — proj, B Poll = I0, —2, 5) — (2, —2, 4)ll = II(—2, 0, DI = v5. 


METHOD 2. In this case, we use a little trigonometry. If 0 denotes the angle 
between the vectors a and BP, as in Figure 1.77, then 


sin 0 = = $ 
|| B Poll 
where D denotes the distance between Pop and the line. Hence, 
ST lice —> 
lall |BPol|sind jax BP; 


llall llall 


D = ||BPy||sing = 
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Figure 1.78 The general 
configuration for finding the 
distance D between two parallel 
planes. 


Figure 1.79 Configuration for 
determining the distance between 
two skew lines in Example 9. 


Therefore, we calculate 


_ i j k 
axBP =| -1 1 —2|=i+5j+2k, 
0-2 5 


so that the distance sought is 


i j + 2k 30 
p- tSt _ V30_ ig 
|-i+j-2kl v6 


which agrees with the answer obtained by Method 1. + 


EXAMPLE 8 (Distance between parallel planes) The planes 
Il: 2x—2y+z=5 and Il: 2x—2y+z= 20 
are parallel. (Why?) We see how to compute the distance between them. 


Using Figure 1.78 as a guide, we see that the desired distance D is given by 


llproja, Pi È|, where P, is a point on Il, Pù is a point on Th, and n is a vector 
normal to both planes. 

First, the vector n that is normal to both planes may be read directly from 
the equation for either T or Th as n = 2i — 2j + k. It is not hard to find a point 
Pı on I: the point Pı (0, 0, 5) will do. Similarly, take P2(0, 0, 20) for a point on 
TI>. Then 


P; P> = (0, 0, 15), 


and calculate 


—> 
e PiP. 2,—2, 1). 1 
po FR = (2 : ‘a= (¢ 1)-@,0, aye 2,1) 


n-n (2, —2, 1). (2, —2, 1) 
= —2(2, —2, 1) 
= —3(2,—2, 1). 
Hence, the distance D that we seek is 
D = |iproj Pi | = ¿V9 =5. + 


EXAMPLE 9 (Distance between two skew lines) Find the distance between 
the two skew lines 


L) = t(2,1,3)+(0,5,—1) and b(t) = t(1,—1, 0) + (—1, 2, 0). 


(Two lines in R are said to be skew if they are neither intersecting nor parallel. 
It follows that the lines must lie in parallel planes and that the distance between 
the lines is equal to the distance between the planes.) 

To solve this problem, we need to find \|proj, B1 B> ||, the length of the projec- 
tion of the vector between a point on each line onto a vector n that is perpendicular 
to both lines, hence, also perpendicular to the parallel planes that contain the lines. 
(See Figure 1.79.) 

From the vector parametric equations for the lines, we read that the point 
B,(0, 5, —1) is on the first line and B2(—1, 2, 0) is on the second. Hence, 


B, Bp = (—1, 2, 0) — (0, 5, —1) = (-1, —3, 1). 


47 


1.5 | Exercises 


For a vector n that is perpendicular to both lines, we may use n = a, x a2, where 
a, = (2, 1, 3) isa vector parallel to the first line and az = (1, —1, 0) is parallel to 
the second. (We may read these vectors from the parametric equations.) Thus, 


i j k 
n=ajxa=|2 1 3|=3i+3j—3k, 
1 -l 0 
and so, 
—> 
. —> n- Bı B2 (—1, —3, 1). (3,3, —3) 
proja B1 B2 = n= (3,3, —3) 
n-n (3, 3, —3)+ (3, 3, —3) 
1 
= —3(3, 3, -3) 
= —4(1, 1, —1). 
The desired distance is ||proj, B1 B2|| = 3/3. + 
1.5 Exercises 
1. Calculate an equation for the plane containing the point 12. Find an equation for the plane that contains the two 
(3, —1, 2) and perpendicular to i — j + 2k. lines h:x =¢t+2,y =3t—5,z=5t+landh:x = 
; ; s ; 5—t,y=3t—10,z=9-—2t. 
2. Find an equation for the plane containing the point 
(9, 5, —1) and perpendicular to i — 2k. 13. Give a set of parametric equations for the line of inter- 
3. Find an equation for the plane containing the points — aes ee ee eet 
(3, —1, 2), (2, 0, 5), and (1, —2, 4). en 
4. Find an equation for the plane containing the points 1a, par Viale puna reooy ie a aa 
(A, 0, 0), (0, B, 0), and (0, 0, C). Assume that at least : ae, 2 at 1$ Deep are at totoe piang ee =y 
two of A, B, and C are nonzero. oo 
5. Give an equation for the plane that is parallel to the 13. A Ta tor cums Gi o = 6y FIAz= 
plane 5x — 4y +z = 1 and that passes through the BR ASL Pee Oke parae 
point (2, —1, —2). 16. Find values for A so that the planes Ax — y +z = 1 


10. 


11. 


. Give an equation for the plane parallel to the plane 2x — 


3y + z = 5 that passes through the point (—1, 1, 2). 


. Find an equation for the plane parallel to the plane x — 


y + 7z = 10 that passes through the point (—2, 0, 1). 


. Give an equation for the plane parallel to the plane 


2x +2y +z = 5 and that contains the line with para- 
metric equations x = 2 — t, y = 2t+1,z=3- 2t. 


. Explain why there is no plane parallel to the plane 


5x — 3y + 2z = 10 that contains the line with para- 
metric equations x = t +4, y = 3t — 2, z = 5 — 2t. 


Find an equation for the plane that contains the line x = 
2t — 1, y = 3t +4, z = 7 — t and the point (2, 5, 0). 


Find an equation for the plane that is perpendicular 
to the line x = 3t — 5, y = 7 — 2t, z = 8 — t and that 
passes through the point (1, —1, 2). 


and 3Ax + Ay — 2z = 5 are perpendicular. 


Give a set of parametric equations for each of the planes de- 
scribed in Exercises 17—22. 


17. 


18. 


19. 


20. 


21. 


The plane that passes through the point (—1, 2, 7) and 
is parallel to the vectors 2i — 3j + k and i — 5k 


The plane that passes through the point (2,9, —4) 
and is parallel to the vectors —8i + 2j + 5k and 
3i — 4j — 2k 


The plane that contains the lines lı: x = 2t + 5, y = 
—3t — 6, z = 4t + 10 and l: x = 5t — 1, y = 10t + 
3,z=7t—2 


The plane that passes through the three points (0, 2, 1), 
(7, —1, 5), and (—1, 3, 0) 


The plane that contains the line /:x = 3t — 5, y = 
10 — 3t, z = 2t + 9 and the point (—2, 4, 7) 
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22 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 
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. The plane determined by the equation 2x — 3y + 
5z =30 


Find a single equation ofthe form Ax + By + Cz = D 
that describes the plane given parametrically as x = 
3s —t +2, y = 4s +t,z =s + 5t + 3. (Hint: Begin 
by writing the parametric equations in vector form and 
then find a vector normal to the plane.) 


Find the distance between the point (1, —2, 3) and the 
line l: x = 2t — 5, y = 3 — t, z = 4. 


Find the distance between the point (2, —1) and the 
line /:x = 3t+7, y = 5t—3. 


Find the distance between the point (—11, 10, 20) and 
the line l: x =5-—t,y=3,z=7t+8. 


Determine the distance between the two lines lı (t) = 
t(8, —1,0)+(—1,3,5) and = bh(t)=7(0,3,1)+ 
(0, 3, 4). 


Compute the distance between the two lines 
l(t) = (t — 7)i+ (5t + Dj+G- 2k and b(t) = 
4ti+ (2 — t)j + (8t + Dk. 


(a) Find the distance between the two lines 1,(t) = 
t(3,1,2)+ (4,0,2) and b(t)=r(1,2,3)+ 
(2, 1, 3). 


(b) What does your answer in part (a) tell you about 
the relative positions of the lines? 


(a) The lines l(t) = t(1,—1,5)+ (2,0,—4) and 
h(t) = t(1,—1,5)+ (1, 3, —S) are parallel. Ex- 
plain why the method of Example 9 cannot be used 
to calculate the distance between the lines. 

(b) Find another way to calculate the distance. (Hint: 
Try using some calculus.) 


Find the distance between the two planes given by the 
equations x — 3y + 2z = 1 and x — 3y + 2z = 8. 


Calculate the distance between the two planes 


5x—2y+2z=12 and — 10x +4y—4z=8. 
Show that the distance d between the two parallel 


planes determined by the equations Ax + By + Cz = 


34. 


35. 


36. 


37. 


38. 


39. 


D, and Ax + By + Cz = Di is 
d= [Dı — D3| 
[AZ + B24 C2 
Two planes are given parametrically by the vector 
equations 


x1(s, t) = (—3, 4, —9) + 5(9, —5, 9) + #(3, —2, 3) 
xo(s, t) = (5, 0, 3) + s(—9, 2, —9) + t(—4, 7, —4). 


(a) Give a convincing explanation for why these 
planes are parallel. 


(b) Find the distance between the planes. 


Write equations for the planes that are parallel to 
x + 3y — 5z = 2 and lie three units from it. 


Suppose that lı (t) = ta + b; and L(t) = ta + by are 
parallel lines in either R? or R*. Show that the distance 
D between them is given by 
_ lla x (b2 — by) I 
llall 

(Hint: Consider Example 7.) 
Let TI be the plane in R? with normal vector n that 
passes through the point A with position vector a. If b 
is the position vector of a point B in R3, show that the 
distance D between B and II is given by 
_ [n+ (b= a) 

Iin] 


D 


Show that the distance D between parallel planes with 
normal vector n is given by 


|n + (x2 — x1 )| 
Iinl] 


’ 


where x, is the position vector of a point on one of the 
planes, and x, is the position vector of a point on the 
other plane. 


Suppose that lı (t) = ta; + bı andh(t) = ta + bz are 
skew lines in R3. Use the geometric reasoning of Ex- 
ample 9 to show that the distance D between these lines 
is given by 


p — (@ X a2) (be — bv 
llai x adl| 
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Vectors in R” 


The algebraic idea of a vector in R? or R? is defined in §1.1, in which we asked 
you to consider what would be involved in generalizing the operations of vector 
addition, scalar multiplication, etc., to n-dimensional vectors, where n can be 
arbitrary. We explore some of the details of such a generalization next. 
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DEFINITION 6.1 A vector in R” is an ordered n-tuple of real numbers. We 
use a = (a1, @, ..., an) as our standard notation for a vector in R”. 


EXAMPLE 1 The 5-tuple (2, 4, 6, 8, 10) is a vector in R. The (n + 1)-tuple 
(2n, 2n — 2, 2n — 4, . . . , 2, 0) is a vector in R”+!, where n is arbitrary. + 


Exactly as is the case in R? or R?, we call two vectors a = (a1, a2, ..., an) and 
b = (bı, b2, ..., bn) equal justin case a; = b; fori = 1, 2, ..., n. Vector addition 
and scalar multiplication are defined in complete analogy with Definitions 1.3 and 
1.4: If a = (a1, @, ..., an) and b = (by, bo, ..., b,) are two vectors in R” and 
k € R is any scalar, then 


a+b= (ai tbr art bares a, + dn) 
and 
ka = (ka, kan, ..., kan). 


The properties of vector addition and scalar multiplication given in $1.1 hold 
(with proofs that are no different from those in the two- and three-dimensional 
cases). Similarly, the dot product of two vectors in R” is readily defined: 


a.b = abı + arba +--+ + anbn. 


The dot product properties given in §1.3 continue to hold in n dimensions; we 
leave it to you to check that this is so. 

What we cannot do in dimensions larger than three is to develop a pictorial 
representation for vectors as arrows. Nonetheless, the power of our algebra and 
analogy does allow us to define a number of geometric ideas. We define the length 
of a vector in a € R” by using the dot product: 


lal = va. a. 
The distance between two vectors a and b in R” is 
Distance between a and b = ||a — b||. 


We can even define the angle between two nonzero vectors by using a generalized 
version of equation (4) of §1.3: 


ı a-b 


0 =cos ———. 
llall Ib] 


Here a, b € R” and 0 is taken so that 0 < 0 < z. (Note: At this point in our 
discussion, it is not clear that we have 


a-b 
<———_ <1, 
llall Ibl] 


which is a necessary condition if our definition of the angle @ is to make sense. 
Fortunately, the Cauchy—Schwarz inequality—formula (1) that follows—takes 
care of this issue.) Thus, even though we are not able to draw pictures of vectors 
in R”, we can nonetheless talk about what it means to say that two vectors are 
perpendicular or parallel, or how far apart two vectors may be. (Be careful about 
this business. We are defining notions of length, distance, and angle entirely in 
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terms of the dot product. Results like Theorem 3.3 have no meaning in R”, since 
the ideas of angles between vectors and dot products are not independent.) 
There is no simple generalization of the cross product. However, see Exer- 
cises 39—42 at the end of this section for the best we can do by way of analogy. 
We can create a standard basis of vectors in R” that generalize the i, j, 
k-basis in R°. Let 


e, =(1,0,0,...,0), 
e2 = (0, 1,0, ...,0), 


e, = (0,0,...,0, 1). 
Then it is not difficult to see (check for yourself) that 
a = (a1, 42, . . . , An) = aye; + a2€2 +--+ + anen. 


Here are two famous (and often handy) inequalities: 


Cauchy—Schwarz inequality. For all vectors a and b in R”, we have 


la-b| < |lall |b. 0) 


PROOF Ifn = 2 or 3, this result is virtually immediate in view of Theorem 3.3. 
However, in dimensions larger than three, we do not have independent notions of 
inner products and angles, so a different proof is required. 

First note that the inequality holds if either a or b is 0. So assume that a and 
b are nonzero. Then we may define the projection of b onto a just as in §1.3: 


-b 
projab = (==) a= ka. 
aca 


Here k is, of course, the scalar a: b/a-a. Let c = b — ka (so that b = ka + c). 
Then we have a-c = 0, since 


a.c = a. (b — ka) 


=a-b—ka-a 
ey 

=a-b—|—— Ja 
a-a 

=a-b—a-b 


We leave it to you to check that the “Pythagorean theorem” holds, namely, that 
the following equation is true: 


lb]? = k7\Jal]? + llell’. 
Multiply this equation by ||a||? = a- a. We obtain 
lal? bl? = jlall?k? lall? + lal? ell? 


2 
a-b 

2 2 2 Wn2 

= |lall (= z) llall- + llall llell 


Figure 1.80 The geometry 
behind the proof of the 
Cauchy—Schwarz inequality. 


Figure 1.81 The triangle 
inequality visualized. 
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aby 
= (a-a) =>) (a-a) + |lall? llel? 


= (a-b) + [jal]? ell’. 
Now, the quantity ||a||*||c¢||? is nonnegative. Hence, 
lal? lb]? > (a +b}. 

Taking square roots in this last inequality yields the result desired. E 


The geometric motivation for this proof of the Cauchy—Schwarz inequality 
comes from Figure 1.80.! 


The triangle inequality. For all vectors a, b € R” we have 


lla + bl] < llall + [|b]. (2) 


PROOF Strategic use of the Cauchy—Schwarz inequality yields 
la + b||? = (a+b)-(a+b) 
=a-a+2a-b+b-b 


< a-a + 2ljal] |bl| +b-b by (1) 
= |iall? + 2lla]i Ibl] + Ibl]? 
= (llall + lbI)? . 
Thus, the result desired holds by taking square roots, since the quantities on both 
sides of the inequality are nonnegative. E 


In two or three dimensions the triangle inequality has the following obvious 
proof from which the inequality gets its name: Since |lal|, ||b||, and ||a + b|| can 
be viewed as the lengths of the sides of a triangle, inequality (2) says nothing 
more than that the sum of the lengths of two sides of a triangle must be at least 
as large as the length of the third side, as demonstrated by Figure 1.81. 


Matrices 


We had a brief glance at matrices and determinants in §1.4 in connection with the 
computation of cross products. Now it’s time for another look. 

A matrix is defined in §1.4 as a rectangular array of numbers. To extend our 
discussion, we need a good notation for matrices and their individual entries. We 
used the uppercase Latin alphabet to denote entire matrices and will continue to 
do so. We shall also adopt the standard convention and use the lowercase Latin 
alphabet and two sets of indices (one set for rows, the other for columns) to 
identify matrix entries. Thus, the general m x n matrix can be written as 


ail 412 *** Gin 
421 422 *** An 


A= j a fi = (shorthand) (a;;). 
| Amı Am2 Amn | 


' See J. W. Cannon, Amer: Math. Monthly 96 (1989), no. 7, 630-631. 
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The first index always will represent the row position and the second index, the 
column position. 

Vectors in R” can also be thought of as matrices. We shall have occasion to 
write the vector a = (a1, a2, ..., an) either as a row vector (a 1 x n matrix), 


a=[ a Ma an |, 


or, more typically, as a column vector (an n x 1 matrix), 


an 


We did not use double indices since there is only a single row or column present. 
It will be clear from context (or else indicated explicitly) in which form a vector 
a will be viewed. An m x n matrix A can be thought of as a “vector of vectors” 
in two ways: (1) as m row vectors in R”, 


| [ai an +++ ain | 
Â= [ an LA an | , 


[ Amı Am2 *** Amn ] 


or (2) as n column vectors in R”, 


a1 a2 din 

a21 a22 a2n 
A= al tee 

Gm Am2 Amn 


We now define the basic matrix operations. Matrix addition and scalar mul- 
tiplication are really no different from the corresponding operations on vectors 
(and, moreover, they satisfy essentially the same properties). 


DEFINITION 6.2 (MATRIX ADDITION) Let A and B betwom x n matrices. 
Then their matrix sum A + B is the m x n matrix obtained by adding cor- 
responding entries. That is, the entry in the ith row and jth column of A + B 
is aj; + bij, where a;; and b;; are the ijth entries of A and B, respectively. 


EXAMPLE 2 If 


then 
8 2 2 
ssee[$ 2 2]. 


: : | then A + B is not defined, since B does not have 


the same dimensions as A. + 


However, if B = 
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Properties of matrix addition. For all m x n matrices A, B, and C we have 
1. A+ B = B + A (commutativity); 
2. A+(B+C)=(A+ B)+C (associativity); 


3. Anm x n matrix O (the zero matrix) with the property that A + O = A 
for all m x n matrices A. 


DEFINITION 6.3 (SCALAR MULTIPLICATION) If A is an m x n matrix and 
k €R is any scalar, then the product kA of the scalar k and the matrix A is 
obtained by multiplying every entry in A by k. That is, the ijth entry of kA 
is kaj; (where a;j is the ijth entry of A). 


1 2 3 3 6 9 
EXAMPLE 3 ta=[; 5 a tensas] 3 15 ral + 


Properties of scalar multiplication. If A and B are any m x n matrices 
and k and / are any scalars, then 

1. (k+ DA = kA + lA (distributivity); 

2. k(A + B) = kA + kB (distributivity); 

3. k(LA) = (kI)A = I(kA). 


We leave it to you to supply proofs of these addition and scalar multiplication 
properties if you wish. 

Just as defining products of vectors needed to be “unexpected” in order to be 
useful, so it is with defining products of matrices. To a degree, matrix multipli- 
cation is a generalization of the dot product of two vectors. 


DEFINITION 6.4 (MATRIX MULTIPLICATION) Let A be an m x n matrix 
and B ann x p matrix. Then the matrix product AB is the m x p matrix 
whose ijth entry is the dot product of the ith row of A and the jth column 
of B (considered as vectors in R”). That is, the ijth entry of 


: ieee bij aa Dip 
. s bo bo; bop 
ie eae Grol . : : 


m1 Am2  *** Amn 


Gi Gi 992 Cip 


is 


a;1b,; + aj2b2; +--+ + Ginbpj = (more compactly) aikbkj- 
=A 
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EXAMPLE 4 If 


0 1 
a=| - l ad Bale 0), 
2 4 


then the (2, 1) entry of AB is the dot product of the second row of A and the first 
column of B: 


(2, 1) entry = [ 4 5 6 ] >| 7 | = (40) + (5X7) + (6X2) = 47. 
2 
The full product AB is the 2 x 2 matrix 
20 B3 
47 28’ 


On the other hand, BA is the 3 x 3 matrix 


4 5 6 
7 14 21 j. 
18 24 30 + 


Order matters in matrix multiplication. To multiply two matrices we must 
have 


Number of columns of left matrix = number of rows of right matrix. 


In Example 4, the products AB and BA are matrices of different dimensions; 
hence, they could not possibly be the same. A worse situation occurs when the 
matrix product is defined in one order and not the other. For example, if A is 2 x 3 
and B is 3 x 3, then AB is defined (and isa 2 x 3 matrix), but B A is not. However, 
even if both products AB and BA are defined and of the same dimensions (as is 
the case if A and B are both n x n, for example), it is in general still true that 


AB + BA. 


Despite this negative news, matrix multiplication does behave well in a number 
of respects, as the following results indicate: 


Properties of matrix multiplication. Suppose A, B, and C are matrices 
of appropriate dimensions (meaning that the expressions that follow are all 
defined) and that k is a scalar. Then 

1. A(BC) = (AB)C; 

2. k(AB) = (KA)B = A(kB); 

Be ANUS T OS Aes ae AVC? 

4. (A+B)C=AC+ BC. 


The proofs of these properties involve little more than Definition 6.4, although 
the notation can become somewhat involved, as in the proof of property 1. 


One simple operation on matrices that has no analogue in the real number 
system is the transpose. The transpose ofan m x n matrix A is then x m matrix 
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A’ obtained by writing the rows of A as columns. For example, if 


1 4 
a=| : al then A=] 2 5 
3 6 


More abstractly, the ijth entry of A’ is a;;, the jith entry of A. 
The transpose operation turns row vectors into column vectors and vice versa. 
We also have the following results: 


(AT) =A, for any matrix A. (3) 


(AB) = BTA", where A ism x n and B isn x p. (4) 


The transpose will largely function as a notational convenience for us. For 
example, consider a, b € R” to be column vectors. Then the dot product a+b can 
be written in matrix form as 

bı 
bz 
a-b = aibi + ab +--+ abn = | a1 az ==> an ] i =a’b. 


bn 
EXAMPLE 5 Matrix multiplication is defined the way it is so that, roughly 
speaking, working with vectors or quantities involving several variables can be 
made to look as much as possible like working with a single variable. This idea 
will become clearer throughout the text, but we can provide an important example 
now. A linear function in a single variable is a function of the form f(x) = ax 
where a is a constant. The natural generalization of this to higher dimensions 
is a linear mapping F: R” —> R”, F(x) = Ax, where A is a (constant) m x n 
matrix and x € R”. More explicitly, F is a function that takes a vector in R” 


(written as a column vector) and returns a vector in R” (also written as a column). 
That is, 


ail A12 +++ Ain x) 

421 @22 ++: An X2 
F(x) = Ax = . : 

Gm1 Am2 *** Amn Xn 


The function F has the properties that F(x + y) = F(x) + F(y) for all x, y € R” 
and F(kx) = kF(x) for all x € R”, k € R. These properties are also satisfied by 
f(x) = ax, of course. Perhaps more important, however, is the fact that linear 
mappings behave nicely with respect to composition. Suppose F is as just defined 
and G: R” — R” is another linear mapping defined by G(x) = Bx, where B is a 
p x m matrix. Then there is a composite function G o F: R” — R? defined by 


G o F(x) = G(F(x)) = G(Ax) = B(Ax) = (BA)x 


by the associativity property of matrix multiplication. Note that BA is defined 
and is a p x n matrix. Hence, we see that the composition of two linear mappings 
is again a linear mapping. Part of the reason we defined matrix multiplication the 
way we did is so that this is the case. + 


EXAMPLE 6 We saw that by interpreting equation (1) in §1.2 in n dimensions, 
we obtain parametric equations of a line in R”. Equation (2) of §1.5, the equation 
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for a plane in R? through a given point (xo, yo, zo) with given normal vector 
n = Ai + Bj + Ck, can also be generalized to n dimensions: 


Ay(%1 — b1) + A2(x2 — b2) + +++ + An(in — bn) = 9. 
If we let A = (A1, A2, ..., An), b = (b1, b2, ..., bn) “constant” vectors), and 
x = (x1, X2,..., Xn) (a “variable” vector), then the aforementioned equation can 


be rewritten as 
A-(x—b)=0 
or, considering A, b, and x as x | matrices, as 
A’ (x —b)=0. 


This is the equation for a hyperplane in R” through the point b with normal 
vector A. The points x that satisfy this equation fill out an (n — 1)-dimensional 
subset of R”. + 


At this point, it is easy to think that matrix arithmetic and the vector geometry 
of R”, although elegant, are so abstract and formal as to be of little practical use. 
However, the next example, from the field of economics,? shows that this is not 
the case. 


EXAMPLE 7 Suppose that we have n commodities. If the price per unit of the 
ith commodity is p;, then the cost of purchasing x; (> 0) units of commodity 
i is pixi. If p = (pı, ..., Pn) is the price vector of all the commodities and 
X = (X1, -. - , Xn) is the commodity bundle vector, then 


PX = pX + P2xX2 +++ + PnXn 
represents the total cost of the commodity bundle. 

Now suppose that we have an exchange economy, so that we may buy and 
sell items. If you have an endowment vector w = (w1, ..., Wn), where w; is the 
amount of commodity i that you can sell (trade), then, with prices given by the 
price vector p, you can afford any commodity bundle x where 

p-x <p-w. 
We may rewrite this last equation as 
p:(x—w) <0. 
In other words, you can afford any commodity bundle x in the budget set 
{x | p+ (x — w) < 0}. The equation p - (x — w) = 0 defines a budget hyperplane 
passing through w with normal vector p. + 


Determinants 


We have already defined determinants of 2 x 2 and 3 x 3 matrices. (See §1.4.) 
Now we define the determinant of any n x n (square) matrix in terms of determi- 
nants of (n — 1) x (n — 1) matrices. By “iterating the definition,” we can calculate 
any determinant. 


2 See D. Saari, “Mathematical complexity of simple economics,” Notices of the American Mathematical 
Society 42 (1995), no. 2, 222-230. 
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DEFINITION 6.5 Let A = (a;;) be an n x n matrix. The determinant of 
A is the real number given by 


IA] = C D'anlAnl + (1) ap lA +--+ (1) ail Ail, 


where Aj; is the (n — 1) x (n — 1) submatrix of A obtained by deleting the 
ith row and jth column of A. 


1 i 3 
—2 i 0 5 
EXAMPLE 8 IfA=| 4 4 _, 9 |-then 
3 =2 1 
-2 5 
TE 2 T ee ea D 
i =l 3 1 1 
3 — 1 1 
According to Definition 6.5, 
2105 1 os 
det =(-1)'*(1)det} 2 -1 0 
4 2—1 0 5 141 
3 —2 1 4 
—2 5 
+1 2de) 4 -1 0 
3 1 1 
= f 5 
+(-1)'(1)det] 4 2 0 
a2 f 
—2 1 
+(—1)!+43)det| 4 2.1 
3 —2 1 


= (DDED + = DGT + AA- 78) 
+(-)GB\-7) 
= —132. + 


The determinant of the submatrix A;; of A is called the i jth minor of A, and 
the quantity (—1)'*/|A;;| is called the ijth cofactor. Definition 6.5 is known as 
cofactor expansion of the determinant along the first row, since det A is written 
as the sum of the products of each entry of the first row and the corresponding 
cofactor (i.e., the sum of the terms aj; times (—1)'*/|A;;|). 

It is natural to ask if one can compute determinants by cofactor expansion 


along other rows or columns of A. Happily, the answer is yes (although we shall 
not prove this). 
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Convenient Fact. The determinant of A can be computed by cofactor ex- 
pansion along any row or column. That is, 


IA] = (C Dian lAn] + (tanli + + (HD ain Ain 
(expansion along the ith row), 
IA] = (D'a jlAijl + (1)? a2j|Aaj| + + (HD nj |Anj 


(expansion along the jth column). 


EXAMPLE 9 To compute the determinant of 


1 2 4 5 
E | 

-1 0j, 

0 2 | 

0 0 0 
expansion along the first row involves more calculation than necessary. In partic- 
ular, one would need to calculate four 4 x 4 determinants on the way to finding 
the desired 5 x 5 determinant. (To make matters worse, these 4 x 4 determinants 


would, in turn, need to be expanded also.) However, if we expand along the third 
column, we find that 


O= Og 
O 
© 


— 
won 
=. NUNS 


det A = (—1)!*3(0) det A13 + (—1)?*7(0) det A23 + (—1)**3(1) det A33 
+ (—1)**9(0) det Ags + (—1)°*9(0) det Ass 


= det A33 
1 2 4 5 
_|2 0 9 0 
oO 2 0 2 
3 1 0 0 


There are several good ways to evaluate this 4 x 4 determinant. We’ll expand 
about the bottom row: 


: 7 : ; 2 4 5 1 4 = 5 

=(-1)'3)}0 9 O}4+(CD*?d)|2 9 0 
C2 © 2 2 0 32 0 0 2 
3 1 0 0 


= (-)GB)(-S4) + (DE) 
= 164. + 


Of course, not all matrices contain well-distributed zeros as in Example 9, 
so there is by no means always an obvious choice for an expansion that avoids 
much calculation. Indeed, one does not compute determinants of large matrices 
by means of cofactor expansion. Instead, certain properties of determinants are 
used to make hand computations feasible. Since we shall rarely need to consider 
determinants larger than 3 x 3, we leave such properties and their significance to 
the exercises. (See, in particular, Exercises 26 and 27.) 


1.6 Exercises 


1. 


Rewrite in terms of the standard basis for R”: 

(a) (1,2,3,...,7) 

(b) (1,0, —1, 1,0, —-1,..., 1,0, —1) (Assume that n 
is a multiple of 3.) 


In Exercises 2—4 write the given vectors without recourse to 
standard basis notation. 


2. 
3. e; — 2e2 + 3e3 — 4ey+---+(—-1)"*!ne, 
4. 
5 


10. 


11. 


12. 


13. 


14. 


e Fez +++ Ten 


e; +e, 


. Calculate the following, where a=(1,3,5,..., 


2n — 1) and b = (2, —4, 6,..., (—1)"*!2n): 
(a) a+b (b) a— b (c) —3a 
(d) Ilall (e) a-b 


. Let n be an even number. Verify the triangle in- 


equality in R” for a= (1,0, 1,0,...,0) and b= 
(0,1,0,1,..., 1). 


. Verify that the Cauchy—Schwarz inequality holds for 


the vectors a = (1, 2,...,n)and b = (1, 1,..., 1). 


. Ifa = (1, —1, 7,3, 2) and b = (2, 5, 0, 9, —1), calcu- 


late the projection proj, b. 


. Show, for all vectors a, b, c € R”, that 


lla — bl] < lla — ell + lle — b||. 
Prove the Pythagorean theorem. That is, if a, b, and 
c are vectors in R” such that a + b =c anda-b = 0, 
then 


lal? + Ibl? = llel’. 
Why is this called the Pythagorean theorem? 


Let a and b be vectors in R”. Show that if ||a + b|| = 
||a — b||, then a and b are orthogonal. 


Let a and b be vectors in R”. Show that if ||a — b|| > 
||a + b||, then the angle between a and b is obtuse (i.e., 
more than 7/2). 


Describe “geometrically” the set of points in R> satis- 
fying the equation 


2(x1 — 1) + 3(x2 + 2) — 7x3 + x4 —4— 5(x5 + 1) =0. 


To make some extra money, you decide to print four 
types of silk-screened T-shirts that you sell at various 
prices. You have an inventory of 20 shirts that you can 
sell for $8 each, 30 shirts that you sell for $10 each, 24 
shirts that you sell for $12 each, and 20 shirts that you 
sell for $15 each. A friend of yours runs a side business 
selling embroidered baseball caps and has an inventory 


15. 


59 


1.6 | Exercises 


of 30 caps that can be sold for $10 each, 16 caps that 
can be sold for $10 each, 20 caps that can be sold for 
$12 each, and 28 caps that can be sold for $15 each. 
You suggest swapping half your inventory of each type 
of T-shirt for half his inventory of each type of baseball 
cap. Is your friend likely to accept your offer? Why or 
why not? 


Suppose that you run a grain farm that produces six 
types of grain at prices of $200, $250, $300, $375, 
$450, $500 per ton. 

(a) Ifx = (x1, ..., x6) is the commodity bundle vec- 
tor (meaning that x; is the number of tons of grain 
i to be purchased), express the total cost of the 
commodity bundle as a dot product of two vectors 
in R6. 

(b) A customer has a budget of $100,000 to be used 
to purchase your grain. Express the set of possible 
commodity bundle vectors that the customer can 
afford. Also describe the relevant budget hyper- 
plane in R6. 


In Exercises 16-19, calculate the indicated matrix quantities 


where 


16. 
18. 
20. 


1-1 0 

C=|2 0 7], D= | A 5 
0 3 -2 

3A—2B 17. AC 

DB 19. BTD 


Then x n identity matrix, denoted / or J,,, is the ma- 
trix whose iith entry is 1 and whose other entries are 
all zero. That is, 


1 0 0 

0 1 0 
In = 7 

0 0 1 


(a) Explicitly write out Jy, 13, and 14. 


(b) The reason 7 is called the identity matrix is that 
it behaves as follows: Let A be any m x n matrix. 
Then 


i. Al, = A. 

ii. ImA = A. 
Prove these results. (Hint: What are the ijth entries 
of the products in (i) and (ii)?) 
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Evaluate the determinants given in Exercises 21—23. 


7 0-1 0 
2 0 1 3 
21./; 3 o 2 
0 5 1-2 
8 0 0 0 
is 1 0 0 
22.| 73 6-1 0 
$ 1 7 
5—1 0 8 1l 
02 1 9 7 
23./0 0 4 3 
00 02 1 
0 0 0 0 -3 


24. Prove that a matrix that has a row or a column con- 
sisting entirely of zeros has determinant equal to 


Zero. 


25. An upper triangular matrix is ann x n matrix whose 
entries below the main diagonal are all zero. (Note: 
The main diagonal is the diagonal going from upper 


left to lower right.) For example, the matrix 


1 2-1 2 
0 3 4 3 
0 0 5 6 
0 0 O 7 


is upper triangular. 


(a) Give an analogous definition for a lower triangu- 


lar matrix and also an example of one. 


(b) Use cofactor expansion to show that the determi- 
nant of any n x n upper or lower triangular matrix 
A is the product of the entries on the main diagonal. 


That is, det A = 411422 ` ` ` nn. 


26. Some properties of the determinant. Exercises 24 
and 25 show that it is not difficult to compute de- 
terminants of even large matrices, provided that the 
matrices have a nice form. The following operations 
(called elementary row operations) can be used to 
transform an n x n matrix into one in upper triangular 


form: 
I. Exchange rows i and j. 
H. Multiply row i by a nonzero scalar. 


HI. Add a multiple of row i to row j. (Row i remains 


unchanged.) 
For example, one can transform the matrix 


0 2 3 
1 7 =2 
5 9 


into one in upper triangular form in three steps: 


Step 1. Exchange rows 1 and 2 (this puts a nonzero 
entry in the upper left corner): 


0 2 3 1 7 =2 
I 7 =2 — 0 2 3 
1 5 9 1 5 9 


Step 2. Add —1 times row 1 to row 3 (this eliminates 
the nonzero entries below the entry in the upper left 


corner): 
1 7 -2 1 7 -2 
0 2 3 — 0 2 3 
1 5 9 0 —2 Ii 


Step 3. Add row 2 to row 3: 


1 7 =2 1 7 -2 
0 2 3 — 0 2 3 
0 —2 1I 0 0 14 


The question is, how do these operations affect the de- 
terminant? 

(a) By means of examples, make a conjecture as to the 
effect of a row operation of type I on the determi- 
nant. (That is, if matrix B results from matrix A by 
performing a single row operation of type I, how are 
det A and det B related?) You need not prove your 
results are correct. 


(b) Repeat part (a) in the case of a row operation of 
type III. 


(c) Prove that if B results from A by multiplying the 
entries in the ith row of A by the scalar c (a type II 
operation), then det B = c - det A. 


27. Calculate the determinant of the matrix 


2 1-2 T 8 

1 0 1-2 4 

A=| -l 1 2 3 -§ 
0 2 3 1 7 

=3. 2 =) -0 1 


by using row operations to transform A into a matrix 
in upper triangular form and by using the results of 
Exercise 26 to keep track of how the determinant of A 
and the determinant of your final matrix are related. 


28. (a) Is det(A + B) = det A + det B? Why or why not? 
(b) Calculate 


1 2 7 
342 1-1 5+1 
0 —2 0 
and 
1 2 7 1 2 


3 1 Sj+);2 -1 1], 
0 —2 0 0-2 0 


and compare your results. 


(c) Calculate 


1 3 243 
0 4 -145 
—1 0 0-2 
and 
1 3 2 1 3 3 
0 4 -1/+ 0 4 #54, 
—1 0 0 —1 0 -2 


and compare your results. 


(d) Conjecture and prove a result about sums of deter- 
minants. (You may wish to construct further exam- 
ples such as those in parts (b) and (c).) 


29. Itis a fact that, if A and B are any n x n matrices, then 


det(A B) = (det A)(det B). 


Use this fact to show that det(A B) = det(B A). (Recall 
that AB Æ BA, in general.) 


Ann x n matrix A is said to be invertible (or nonsingular) if 
there is another n x n matrix B with the property that 


AB = BA= h, 


where I, denotes the n x n identity matrix. (See Exercise 20.) 
The matrix B is called an inverse to the matrix A. Exercises 
30-38 concern various aspects of matrices and their inverses. 


30. 


31. 


32. 


33. 


34. 


(a) Verify that | A | is an inverse or E i | 


12 3 
(b) Verify that 2 is an inverse of 
1 0 8 
—40 16 9 
13-5 -3 
5 —2 -1 


Using the definition of an inverse matrix, find an 


2 2 1 
inverseto| 0 1 0 
0 0 -l 
0 2 1 
Try to find an inverse matrix to | 0 1 0 
0 0 -l 
What happens? 


Show that ifan n x n matrix A is invertible, then A can 
have only one inverse matrix. Thus, we may write A~! 
to denote the unique inverse of a nonsingular matrix 
A. (Hint: Suppose A were to have two inverses B and 
C. Consider B(AC).) 


Suppose that A and B are n x n invertible matrices. 
Show that the product matrix AB is invertible by ver- 
ifying that its inverse (AB)! = B7! A™!. 
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35. (a) Show that if A is invertible, then det A 4 0. (In 
fact, the converse is also true.) 


(b) Show that if A is invertible, then 


1 
det(A7!) = det A’ 


36. (a) Show that, if ad — bc # 0, then a general 2 x 2 


; a b é 
matrix a | has the matrix 


1 d —b = aa a 
ad—be| —-¢ aj | -am TET 


as inverse. 


(b) Use this formula to find an inverse of | E l 

37. If A is a 3 x 3 matrix and det A Æ 0, then there is 
a (somewhat complicated) formula for A~!. In 
particular, 


[Aul —|Aal = |Aail 
—|Aj2| |Az2| —|Az2l |, 
|A] —|A23|  lA33l 


_ 1 
~ det A 


where A;; denotes the submatrix of A obtained by 
deleting the ith row and jth column (see Defini- 
tion 6.5). Use this formula to find the inverse of 


> 
II 
=. oN 
© U s= 
wpe 


More generally, if A is any n x n matrix and det A Æ 0, 
then 


E 1 
mea a 


where adj A is the adjoint matrix of A, that is, the 
matrix whose ijth entry is (—1)'t/|A;;|. (Note: The 
formula for the inverse matrix using the adjoint is typi- 
cally more of theoretical than practical interest, as there 
are more efficient computational methods to determine 
the inverse, when it exists.) 


38. Repeat Exercise 37 with the matrix 


2b 3 
A=; 1 2 -2 
3 0 1 


Cross products in R”. Although it is not possible to define a 
cross product of two vectors in R” as we did for two vectors 
in R?, we can construct a “cross product” of n — 1 vectors 
in R” that behaves analogously to the three-dimensional cross 
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product. To be specific, if 


(c) ay X ++: x (a; +b) x ++: X an1 
=a XXa X- X an-1 + 


a, = (411, 412,---, Alin), a2 = (421, 422,.--, drn),-.-, 
( n) ( n) ay Ke KX DK X ani, 
an-1 = (Qn—11, Gn-12,+++5 anin) l<i<n- il, all b € R”. 
aren — | vectors in R”, we define a X a2 X +++ X an—ı to be (d) Show that if b = (b1, .. . , bn) is any vector in R”, 


the vector in R” given by the symbolic determinant then 
ei e2 Es Cn 
ayy a2 eee Ain b. (a; X a2 MR see X An—1) 
aj Xa X- X an1 = a21 a22 Eka Mn aS , 
is given by the determinant 

đn-11 an-12 Qn-1n 

l bi bet bn 

(Here èi, ..., €, are the standard basis vectors for R" .) Exer- ai = a 
. . . . n 
cises 39-42 concern this generalized notion of cross product. 
39. Calculate the following cross product in R4: 
Qn-11 Gn-1n 


(1, 2, —1, 3) x (0, 2, —3, 1) x (—5, 1, 6, 0). 
40. Use the results of Exercises 26 and 28 to show that 


41. Show that the vector b= a, Xa X =- 


X an-ı is 


(a) aX- xa XxX 


(b) a; x --- x ka; x--- 


1l<i<n-l. 


eP 


x 

Figure 1.82 The Cartesian 
coordinate system. 

y 

A 
Location y 

l Ply) 

~ Ea =X 
4 Location x 


Figure 1.83 Locating a point P, 
using Cartesian coordinates. 


x aj x 
= —(aj X---xajx- 
l<i<n-1,l<j<n-l 


++ xX aX ee: 


XxX an-ı 
= k(aı Xa K a Kee: 


orthogonal to a;,..., an—1- 


vas xX an—1 

X Aan), 42. Use the generalized notion of cross products to 
find an equation of the (four-dimensional) hyper- 
plane in R through the five points Po(1, 0,3, 0, 4), 
P\(2, —1,0,0,5), P(7,0,0, 2,0), P3(2, 0, 3,0, 4), 


X Ant), 
and P,(1, —1, 3, 0, 4). 


1.7 New Coordinate Systems 


We hope that you are comfortable with Cartesian (rectangular) coordinates for R? 
or R°. The Cartesian coordinate system will continue to be of prime importance to 
us, but from time to time, we will find it advantageous to use different coordinate 
systems. In R’, polar coordinates are useful for describing figures with circular 
symmetry. In R, there are two particularly valuable coordinate systems besides 
Cartesian coordinates: cylindrical and spherical coordinates. As we shall see, 
cylindrical and spherical coordinates are each a way of adapting polar coordinates 
in the plane for use in three dimensions. 


Cartesian and Polar Coordinates on R’ 


You can understand the Cartesian (or rectangular) coordinates (x, y) of a point 
P in R? in the following way: Imagine the entire plane filled with horizontal and 
vertical lines, as in Figure 1.82. Then the point P lies on exactly one vertical line 
and one horizontal line. The x-coordinate of P is where this vertical line intersects 
the x-axis, and the y-coordinate is where the horizontal line intersects the y-axis. 
(See Figure 1.83.) (Of course, we’ve already assigned coordinates along the axes 
so that the zero point of each axis is at the point of intersection of the axes. We 
also normally mark off the same unit distance on each axis.) Note that, because 
of this geometry, every point in R? has a uniquely determined set of Cartesian 
coordinates. 

Polar coordinates are defined by considering different geometric informa- 
tion. Now imagine the plane filled with concentric circles centered at the origin 
and rays emanating from the origin. Then every point except the origin lies on 


(Ir, 8) 


(Ar, 0) 


Figure 1.86 Locating the point 
with polar coordinates (r, 0), 


where r < 0. 


0 r = 6cos0 

0 6 
1/6 3/3 
7/4 3/2 
a/3 3 
m/2 0 
27/3 -3 
37/4 —3/2 
57/6 —3/3 

T —6 
Tx /6 -3/3 
57/4 —3/2 
47/3 -3 
37/2 0 
57/3 3 
17/4 3/2 
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Figure 1.84 The polar coordinate Figure 1.85 Locating a point P, 
system. using polar coordinates. 


exactly one such circle and one such ray. The origin itself is special: No circle 
passes through it, and all the rays begin at it. (See Figure 1.84.) For points P other 
than the origin, we assign to P the polar coordinates (r, 0), where r is the radius 
of the circle on which P lies and @ is the angle between the positive x-axis and 
the ray on which P lies. (0 is measured as opening counterclockwise.) The origin 
is an exception: It is assigned the polar coordinates (0, 0), where 0 can be any 
angle. (See Figure 1.85.) As we have described polar coordinates, r > 0 since r 
is the radius of a circle. It also makes good sense to require 0 < 0 < 27, for then 
every point in the plane, except the origin, has a uniquely determined pair of polar 
coordinates. Occasionally, however, it is useful not to restrict r to be nonnegative 
and 6 to be between 0 and 27. In such a case, no point of R? will be described by 
a unique pair of polar coordinates: If P has polar coordinates (r, 0), then it also 
has (r, 0 + 2nz) and (—r, 0 + (2n + 1)z) as coordinates, where n can be any 
integer. (To locate the point having coordinates (r, 0), where r < 0, construct the 
ray making angle 0 with respect to the positive x-axis, and instead of marching 
|r| units away from the origin along this ray, go |r| units in the opposite direction, 
as shown in Figure 1.86.) 


EXAMPLE 1 Polar coordinates may already be familiar to you. Nonetheless, 
make sure you understand that the points pictured in Figure 1.87 have the coor- 


dinates indicated. + 
(3V2, 2/4) 
(3V3, 7/6) 
(2, 57/6) (2, 2/6) 
(5, 0) (6, 0) 
d 
(-1, 57/6) or 
(1, 117/6) or 
(3, 3n/2y¢ 6) 
Figure 1.87 Figure for Figure 1.88 The graph of 
Example 1. r = 6cos@ in Example 2. 


EXAMPLE 2 Let’s graph the curve given by the polar equation r = 6 cos 0 
(Figure 1.88). We can begin to get a feeling for the graph by compiling values, as 
in the adjacent tabulation. 
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Thus, r decreases from 6 to 0 as 0 increases from 0 to 7/2; r decreases from 
0 to —6 (or is not defined, if you take r to be nonnegative) as 0 varies from x /2 
to x; r increases from —6 to 0 as 0 varies from z to 37/2; and r increases from 
0 to 6 as 6 varies from 37/2 to 27. To graph the resulting curve, imagine a radar 
screen: As 0 moves counterclockwise from 0 to 277, the point (r, 0) of the graph is 
traced as the appropriate “blip” on the radar screen. Note that the curve is actually 
traced twice: once as 0 varies from 0 to z and then again as 0 varies from z to 
27. Alternatively, the curve is traced just once if we allow only @ values that yield 
nonnegative r values. The resulting graph appears to be a circle of radius 3 (not 
centered at the origin), and, in fact, one can see (as in Example 3) that the graph 
is indeed such a circle. + 


The basic conversions between polar and Cartesian coordinates are provided 
by the following relations: 


Polar to Cartesian: | oy : (1) 
y=rsing 
; Han y 
Cartesian to polar: | a (2) 


Note that the equations in (2) do not uniquely determine r and 0 in terms of x 
and y. This is quite acceptable, really, since we do not always want to insist that 
r be nonnegative and 0 be between 0 and 27. If we do restrict r and 6, however, 
then they are given in terms of x and y by the following formulas: 


co eae 
tan™! y/x ifx>0,y>0 
tan! y/x+2m ifx>0,y <0 


tan! y/x +7 ifx<0,y>0 
x /2 ifx =0,y>0— 
37/2 ifx =0,y <0 
indeterminate tx=y =0 


The complicated formula for 6 arises because we require 0 < 0 < 2x, while the 
inverse tangent function returns values between —z/2 and 2/2 only. Now you 
see why the equations given in (2) are a better bet! 


EXAMPLE 3 We can use the formulas in (1) and (2) to prove that the curve in 
Example 2 really is a circle. The polar equation r = 6 cos @ that defines the curve 
requires a little ingenuity to convert to the corresponding Cartesian equation. The 
trick is to multiply both sides of the equation by r. Doing so, we obtain 


r? = 6rcosé. 


Now (1) and (2) immediately give 


x 


Figure 1.89 The cylindrical 
coordinate system. 


Figure 1.90 Locating a point P, 
using cylindrical coordinates. 


Figure 1.91 The graph of the 
cylindrical equation r = ro. 
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We complete the square in x to find that this equation can be rewritten as 
@œ&-—3f +y =9, 


which is indeed a circle of radius 3 with center at (3, 0). + 


Cylindrical Coordinates 


Cylindrical coordinates on R? are a “naive” way of generalizing polar coordinates 
to three dimensions, in the sense that they are nothing more than polar coordinates 
used in place of the x- and y-coordinates. (The z-coordinate is left unchanged.) 
The geometry is as follows: Except for the z-axis, fill all of space with infinitely 
extended circular cylinders with axes along the z-axis as in Figure 1.89. Then 
any point P in R? not lying on the z-axis lies on exactly one such cylinder. 
Hence, to locate such a point, it’s enough to give the radius of the cylinder, the 
circumferential angle 0 around the cylinder, and the vertical position z along the 
cylinder. The cylindrical coordinates of P are (r, 0, z), as shown in Figure 1.90. 
Algebraically, the equations in (1) and (2) can be extended to produce the basic 
conversions between Cartesian and cylindrical coordinates. 


The basic conversions between cylindrical and Cartesian coordinates are 
provided by the following relations: 


x =rcosd 
Cylindrical to Cartesian: y=rsine ; (3) 
Z= 


Ply 
Cartesian to cylindrical: toe =y. (4) 
HSR 


As with polar coordinates, if we make the restrictions r > 0, 0 < 0 < 2x, then 
all points of R? except the z-axis have a unique set of cylindrical coordinates. A 
point on the z-axis with Cartesian coordinates (0, 0, zo) has cylindrical coordinates 
(0, 8, zo), where 0 can be any angle. 


Cylindrical coordinates are useful for studying objects possessing an axis of 
symmetry. Before exploring a few examples, let’s understand the three “constant 
coordinate” surfaces. 


e The r = ro surface is, of course, just a cylinder of radius ro with axis the 
z-axis. (See Figure 1.91.) 


e The 0 = 9p surface is a vertical plane containing the z-axis (or a half-plane 
with edge the z-axis if we take r > 0 only). (See Figure 1.92.) 


e The z = Zo surface is a horizontal plane. (See Figure 1.93.) 
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Figure 1.94 The graph of 
r = 6cos@ in cylindrical 
coordinates. 


Figure 1.95 The graph of z = 2r 
in cylindrical coordinates. 


` Half-plane only 
N ifr=0 
Z=% 


Ay 


ny 


Figure 1.93 The graph of 
Figure 1.92 The graph of 0 = 6p. Z = Zo. 


EXAMPLE 4 Graph the surface having cylindrical equation r = 6 cos 0. (This 
equation is identical to the one in Example 2.) In particular, z does not appear 
in this equation. What this means is that if the surface is sliced by the horizontal 
plane z = c where c is a constant, we will see the circle shown in Example 2, no 
matter what c is. If we stack these circular sections, then the entire surface is a 
circular cylinder of radius 3 with axis parallel to the z-axis (and through the point 
(3, 0, 0) in cylindrical coordinates). This surface is shown in Figure 1.94. + 


EXAMPLE 5 Graph the surface having equation z = 2r in cylindrical co- 
ordinates. 

Here the variable 6 does not appear in the equation, which means that the 
surface in question will be circularly symmetric about the z-axis. In other words, 
if we slice the surface by any plane of the form 0 = constant (or half-plane, if we 
take r > 0), we see the same curve, namely, a line (respectively, a half-line) of 
slope 2. As we let the constant-0 plane vary, this line generates a cone, as shown 
in Figure 1.95. The cone consists only of the top half (nappe) when we restrict r 
to be nonnegative. 

The Cartesian equation of this cone is readily determined. Using the formulas 
in (4), we have 


z=2 => Y=4rP 4> z = A(x? + y’). 


Since z can be positive as well as negative, this last Cartesian equation describes 
the cone with both nappes. If we want the top nappe only, then the equation 
z = 2,/x2 + y? describes it. Similarly, z = —2,/x?2 + y? describes the bottom 
nappe. + 


Spherical Coordinates 


Fill all of space with spheres centered at the origin as in Figure 1.96. Then every 
point P € R?, except the origin, lies on a single such sphere. Roughly speaking, 
the spherical coordinates of P are given by specifying the radius p of the sphere 
containing P and the “latitude and longitude” readings of P along this sphere. 
More precisely, the spherical coordinates (p, y, 0) of P are defined as follows: p 
is the distance from P to the origin; g is the angle between the positive z-axis and 
the ray through the origin and P; and 0 is the angle between the positive x-axis 
and the ray made by dropping a perpendicular from P to the x y-plane. (See Figure 
1.97.) The 6-coordinate is exactly the same as the 0-coordinate used in cylindrical 
coordinates. (Warning: Physicists usually prefer to reverse the roles of g and 0, 
as do some graphical software packages.) 


1/4 


(1; 7/4, 0) 7 


(2, x, 1/4) or 
(2, m, 1/3) or 
(-2, 0, 0) 


Figure 1.98 Figure for Example 6. 


(2, 2/4, 2/2) 


(2, n/2, 2/2) 
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Figure 1.96 The spherical Figure 1.97 Locating the point 
coordinate system. P, using spherical coordinates. 


It is standard practice to impose the following restrictions on the range of 
values for the individual coordinates: 


p= 0, 0<9g<nZ, 0<0 <2z. (5) 


With such restrictions, all points of R?, except those on the z-axis, have a uniquely 
determined set of spherical coordinates. Points along the z-axis, except for the 
origin, have coordinates of the form (po, 0, 0) or (p0, 7, 9), where po is a positive 
constant and @ is arbitrary. The origin has spherical coordinates (0, g, 0), where 
both g and @ are arbitrary. 


EXAMPLE 6 Several points and their corresponding spherical coordinates are 
shown in Figure 1.98. + 


Figure 1.99 The graph of Figure 1.100 The spherical 
p = po (> 0). surface œ = pọ, shown for 
different values of po. 


Spherical coordinates are especially useful for describing objects that have 
a center of symmetry. With the restrictions given by the inequalities in (5), the 
constant coordinate surface o = po (po > 0) is, of course, a sphere of radius po, 
as shown in Figure 1.99. The surface given by 0 = @p is a half-plane just as in the 
cylindrical case. The y = @p surface is a single-nappe cone if gp 4 2/2 and is 
the xy-plane if pọ = 7/2 (and is the positive or negative z-axis if pọ = 0 or 7). 
(See Figure 1.100.) If we do not insist that p be nonnegative, then the cones would 
include both nappes. 
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The basic equations relating spherical coordinates to both cylindrical and 
Cartesian coordinates are as follows. 
Spherical/cylindrical: 
r=psing p=rt2 
0 =0 tmp =r . (6) 
z = p COs @ 0=90 
Spherical/Cartesian: 
x = psing cos P =x +y +z 
y = psing sind tang = yx? + y2/z. (7) 
z= p Cosy tan = y/x 


Figure 1.101 Converting Figure 1.102 Converting 
spherical to cylindrical coordinates spherical to cylindrical 
when 0 < 9 < 3. coordinates when 17/2 <y < m. 

Using basic trigonometry, it is not difficult to establish the conversions in (6). 
From the right triangle shown in Figure 1.101, we have 


(G-9)=; 
cos(~—g)=-—. 
7 p 


p 
Hence, 

TT g 

r = pcos (5 — o) = psing. 
Similarly, 
TE 

z YAT p 

so that 


_ (T 
A= psin (> -¢) = p cos Q. 

Thus, the formulas in (6) follow when 0 < g < 2/2. If m/2 < g < m, then we 

may employ Figure 1.102. So 


T : 
r= pcos (9 — 5) = psing, 
and 


` T í T 
z= —p sin (g — =) = psin (5-9) = p cos Q. 


p = 2a cos 


2a 
V3a 
J2a 
a 
0 
—a 


—/2a 
—2a 
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Hence, the relations in (6) hold in general. The equations in (7) follow by substi- 


tution of those in (6) into those of (3) and (4). 


EXAMPLE 7 The cylindrical equation z = 2r in Example 5 converts via (6) 
to the spherical equation 


pcosg = 2psing. 


Therefore, 
l E 
tng =; = g=tan as 
Thus, the equation defines a cone (as we just saw). The spherical equation is 
especially simple in that it involves just a single coordinate. + 


EXAMPLE 8 Notall spherical equations are improvements over their cylindri- 
cal or Cartesian counterparts. For example, the Cartesian equation 6x = x? + y? 
(whose polar—cylindrical equivalent is r = 6 cos 0) becomes 


6p sing cos@ = p° sin? o cos? 6 + p° sin’ ọ sin? 0 
from (7). Simplifying, 


6p sing cos = p° sin? o (cos? 0 + sin? 0) 
s 6p sing cos = p? sin’ y 
C 6cos0 = psing. 


This spherical equation is more complicated than the original Cartesian equation 
in that all three spherical coordinates are involved. Therefore, it is not at all 
obvious that the spherical equation describes a cylinder. + 


EXAMPLE 9 Let’ graph the surface with spherical equation p = 2a cos ọ, 
where a > 0. As with the graph of the cone with cylindrical equation z = 2r, 
note that the equation is independent of 0. Thus, all sections of this surface made 
by slicing with the half-plane 0 = c must be the same. If we compile values as 
in the adjacent table, then the section of the surface in the half-plane 0 = 0 is as 
shown in Figure 1.103. Since this section must be identical in all other constant- 
half-planes, we see that this surface appears to be a sphere of radius a tangent to 
the xy-plane, which is shown in Figure 1.104. 


- (2a, 0, 0) 
0=0 


Section of 
p = 2a cos 


p” 0 = 7/4 
Figure 1.103 The cross section Figure 1.104 The graph of p = 


of p =2a cos ọ in the half-plane 2a COS Q. 
0=0. 
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The Cartesian equation of the surface is determined by multiplying both sides 
of the spherical equation by p and using the conversion equations in (7): 


p =2acosy => p° = 2ap coso 
4 x +y +z =2az 


x2 y? (z ay = a 


by completing the square in z. This last equation can be recognized as that of a 
sphere of radius a with center at (0, 0, a) in Cartesian coordinates. + 


EXAMPLE 10 NASA launches a 10-ft-diameter space probe. Unfortunately, 
a meteor storm pushes the probe off course, and it is partially embedded in 
the surface of Venus, to a depth of one quarter of its diameter. To attempt to 
reprogram the probe’s on-board computer to remove it from Venus, it is necessary 
to describe the embedded portion of the probe in spherical coordinates. Let us 
find the description desired, assuming that the surface of Venus is essentially flat 
in relation to the probe and that the origin of our coordinate system is at the center 
of the probe. 


Probe 


A l eS, 
10/4' XN Surface of Venus 


a 5/2 
—_— 
Figure 1.105 The space probe of Example 10. Figure 1.106 A slice of the probe 


Figure 1.107 Coordinate view of 
the cross section of the probe of 
Example 10. 


of Example 10. 


The situation is illustrated in Figure 1.105. The buried part of the probe clearly 
has symmetry about the z-axis. That is, any slice by the half-plane 0 = constant 
looks the same as any other. Thus, 0 can vary between 0 and 277. A typical slice 
of the probe is shown in Figure 1.106. Elementary trigonometry indicates that for 
the angle œ in Figure 1.106, 

1 


cosa = : 
2 


[Nin 


Hence, œ = cos”! 5 = 7/3. Thus, the spherical angle y (which opens from the 


positive z-axis) varies from a — 2/3 = 27/3 to x as it generates the buried part 
of the probe. Finally, note that for a given value of g between 27/3 and x, p 
is bounded by the surface of Venus (the plane z = —3 in Cartesian coordinates) 
and the spherical surface of the probe (whose equation in spherical coordinates is 
p = 5). See Figure 1.107. From the formulas in (7) the equation z = —3 corre- 
sponds to the spherical equation p cosy = -3 or, equivalently, to o = -3 Sec Q. 
Therefore, the embedded part of the probe may be defined by the set 


5 2 
{(0.0.0)|-seevsn<5, 5 <e<n0<0 <2). b 


Figure 1.108 The standard basis 
vectors for the cylindrical 
coordinate system. 


Figure 1.109 The standard basis 
vectors for the spherical 
coordinate system. 
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Standard Bases for Cylindrical 
and Spherical Coordinates 


In Cartesian coordinates, there are three special unit vectors i, j, and k that point 
in the directions of increasing x-, y-, and z-coordinate, respectively. We find 
corresponding sets of vectors for cylindrical and spherical coordinates. That is, 
in each set of coordinates, we seek mutually orthogonal unit vectors that point in 
the directions of increasing coordinate values. 

In cylindrical coordinates, the situation is as shown in Figure 1.108. The 
vectors €,, €g, and e,, which form the standard basis for cylindrical coordinates, 
are unit vectors that each point in the direction in which only the coordinate 
indicated by the subscript increases. There is an important difference between 
the standard basis vectors in Cartesian and cylindrical coordinates. In the former 
case, i, j, and k do not vary from point to point. However, the vectors e, and eg 
do change as we move from point to point. 

Now we give expressions for e,, eg, and e,. Since the cylindrical z-coordinate 
is the same as the Cartesian z-coordinate, we must have e, = k. The vector 
e. must point radially outward from the z-axis with no k-component. At a 
point (x, y, z) € R? (Cartesian coordinates), the vector xi+ yj has this prop- 
erty. Normalizing it to obtain a unit vector (see Proposition 3.4 of §1.3), we 
obtain 


a= xit+ yj 
: [2 y2 


With e, and e, in hand, it’s now a simple matter to define eg, since it must be 
perpendicular to both e, and e,. We take 


—yit+ xj 
[x2 y2 


(The reason for this choice of cross product, as opposed to e, x e,, is so that eg 
points in the direction of increasing 0.) To summarize, and using the cylindrical 
to Cartesian conversions given in (3), 


€p =e, xe; = 


= cosdi-+ siné j; 


sin i + cosé j; 


In spherical coordinates, the situation is shown in Figure 1.109. In particular, 
there are three unit vectors ep, €y, and eg that form the standard basis for spherical 
coordinates. These vectors all change direction as we move from point to point. 

We give expressions for e,, €y, and eg. Since the 6-coordinates in both spher- 
ical and cylindrical coordinates mean the same thing, eg in spherical coordinates 
is given by the value of eg in (8). At a point (x, y, z), the vector e, should point 
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from the origin directly to (x, y, z). Thus, e, may be obtained by normalizing 
xi + yj + zk. Finally, e, is nothing more than eg x ep. If we explicitly perform 
the calculations just described and make use of the conversion formulas in (7), 
the following are obtained: 


i j+zk 
e, = oat eae = sing cos i + sing sindj+cos¢gk; 
x24 y2+ 22 


_ xzi+yzj -+ y’)k 
ya +y yx +y +22 


coso cos i+ cosg sind j — sin ọ kK; 


ep 


—yi+ xj 


x2 4 


sinð i+ cos 6 j. 


Although the results of (8) and (9) will not be used frequently, they will prove 
helpful on occasion. 


Hyperspherical Coordinates (optional) 


There is a way to provide a set of coordinates for R” that generalizes spherical 
coordinates on RÌ. Forn > 3, the hyperspherical coordinates of a point P € R” 
are (0, 91, 2, .. -, @n-1) and are defined by their relations with the Cartesian 
coordinates (x1, X2,...,Xn) of P as 


xı = p Sin GY] SiN G2 - «+ SIN Py_-2 COS Øn—1 
X2 = p Sin Y] SiN G2 - ++ SİN Øn—2 SIN Py} 
X3 = p SiN Qı SiN G2 - - - SIN Øn—3 COS Py—2 


(10) 


X4 = p SiN fı SİN p2 - ++ SIN Pn—4 COS Pn—3 


Xn = PCOS Q1 
To be more explicit, in equation (10) above we take 
Xk = p SiN Y1 SING? +--SiINGy_~ COS Pn-k+1 fork =3,...,n. 
Note that when n = 3, the relations in (10) become 
xı = p Sing) COS Q2 
x2: = p singi sinp . 
X3 = p COS Q1 


These relations are the same as those given in (7), so hyperspherical coordinates 
are indeed the same as spherical coordinates when n = 3. 

In analogy with (5), it is standard practice to impose the following restrictions 
on the range of values for the coordinates: 


p20, 0< gg <m fork=1,...,n— 2, 0 < n-1ı < 2m. (11) 
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Then, with these restrictions, we can convert from hyperspherical coordinates to 
Cartesian coordinates by means of the following formulas: 


= 2 2 
po =x + x5 ee x; 


tang =,/x? tes + x2 1/Xn 
tang = x] + H Xp-2/ Xn (12) 


2 2 
tan Q,-2 = xi + x3 /xX3 
tan Øn-1 = X2/X| 


Hyperspherical coordinates get their name from the fact that the (n — 1)- 
dimensional hypersurface in R” defined by the equation po = po, where pọ is a 
positive constant, consists of points on the hypersphere of radius pọ defined in 


Cartesian coordinates by the equation 


1.7 Exercises 


In Exercises 1—3, find the Cartesian coordinates of the points 
whose polar coordinates are given. 


1. (/2, 1/4) 
2. (/3, 57/6) 
3. (3, 0) 


In Exercises 4—6, give a set of polar coordinates for the point 
whose Cartesian coordinates are given. 


4. (2/3, 2) 
5. (—2, 2) 
6: (1-3) 


In Exercises 7—9, find the Cartesian coordinates of the points 
whose cylindrical coordinates are given. 


7. (2, 2,2) 
8. (m, 1/2, 1) 
9. (1, 27/3, —2) 


In Exercises 10-13, find the rectangular coordinates of the 
points whose spherical coordinates are given. 


10. (4, 2/2, 7/3) 
11. (3, 2/3, 2/2) 
12. (1, 37/4, 27/3) 
13. (2, x, 7/4) 


xp bag +i H = Po: 


In Exercises 14-16, find a set of cylindrical coordinates of the 
point whose Cartesian coordinates are given. 


14. (—1, 0, 2) 
15. (—1, V3, 13) 
16. (5, 6, 3) 


In Exercises 17 and 18, find a set of spherical coordinates of 
the point whose Cartesian coordinates are given. 


17. (1, —1, V6) 

18. (0, V3, 1) 

19. This problem concerns the surface described by the 
equation (r — 2)? + z? = 1 in cylindrical coordinates. 
(Assume r > 0.) 

(a) Sketch the intersection of this surface with the half- 
plane 0 = 7/2. 
(b) Sketch the entire surface. 
20. (a) Graph the curve in R? having polar equation r = 
2a sin @, where a is a positive constant. 
(b) Graph the surface in R? having spherical equation 
p = 2asing. 

21. Graph the surface whose spherical equation is p = 
1 — cos ọ. 

22. Graph the surface whose spherical equation is p = 
1 — sin ọ. 


In Exercises 23—25, translate the following equations from 
the given coordinate system (i.e., Cartesian, cylindrical, or 
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spherical) into equations in each of the other two systems. In 
addition, identify the surfaces so described by providing ap- 
propriate sketches. 


23. psing sind = 2 
24. z? = 2x? + 2y? 
25.r=0 


In Exercises 26-29, sketch the solid whose cylindrical coordi- 
nates (r, 0, z) satisfy the given inequalities. 

26. 0<r<3, 0<0<x/2, -l<z<2 
27.r<z<5, 
28. 2r <z<5-3r 


29. r? —1<z<5-r? 


0<0<x 


In Exercises 30-35, sketch the solid whose spherical coordi- 
nates (p, 9, 0) satisfy the given inequalities. 


30. 1<p<2 
31.0<p<l, 0<ọ<x/2 
32. 0<p<1, 0<0<x/2 


33. 0< <x/4, 0<p< 
34. 0<p<2/cosg, 0 
35. 2cosp <p <3 
36. (a) Which points P in R? have the same rectangular 
and polar coordinates? 
(b) Which points P in R? have the same rectangular 
and cylindrical coordinates? 
(c) Which points P in R? have the same rectangular 
and spherical coordinates? 
37. (a) How are the graphs of the polar equationsr = f(0) 
andr = — f (0) related? 
(b) How are the graphs of the spherical equations 
p = f(g, 9) and p = — f(g, 0) related? 
(c) Repeat part (a) for the graphs of r = f(@) and 


r=3f(@). 
(d) Repeat part (b) for the graphs of p = f(g, 0) and 
p =3f(,9). 


38. Suppose that a surface has an equation in cylindrical 
coordinates of the form z = f(r). Explain why it must 
be a surface of revolution. 


39. (a) Verify that the basis vectors e,, eg, and e; for 
cylindrical coordinates are mutually perpendicu- 
lar unit vectors. 

(b) Verify that the basis vectors ep, €y, and eg for spher- 
ical coordinates are mutually perpendicular unit 
vectors. 


40. Use the formulas in (8) to express i, j, k in terms of e,, 
€g, and ez. 


41. Use the formulas in (9) to express i, j, k in terms of e,, 
ey, and ey. 

42. Consider the solid in R? shown in Figure 1.110. 
(a) Describe the solid, using spherical coordinates. 
(b) Describe the solid, using cylindrical coordinates. 


A portion of the 
Z sphere of radius 3 
(centered at origin) 


x 


Figure 1.110 The ice-cream- 
cone-like solid in R? in Exercise 42. 


In Exercises 43—47, you will use the equations in (10) to es- 
tablish those in (12). 


43. Show that tan g,_) = x2/x1. 
44. (a) Calculate xr + x in terms of the hyperspherical 
coordinates p, 91, ..-, Qn—2- 
(b) Assuming the inequalities in (11), use part (a) to 


show that tan g,_2 = X + x2 /x3. 


45. (a) Calculate x? + x? + x} in terms of the hyperspher- 
ical coordinates p, 91, ..-, Øn-3- 


(b) Assuming the inequalities in (11), use part (a) to 


show that tan pn-3 = , |x? + xz + Ne ee 


46. (a) Fork =2,...,n—1, show that x? + x2 +++. + 
x = p? sin? gie sin? Pn-k. (Note: This is best 
accomplished by means of mathematical induc- 
tion.) 

(b) Assuming the inequalities in (11), use part (a) 
to show that, for k = 2,...,n — 1, tan n-k = 


VXT H HAR Xk. 


47. Show that x? + x2 +- + x2 = p’. 


True/False Exercises for Chapter 1 


NY OA fh 


. The displacement vector 


. Ifa = (1,7, —9) and b = (1, —9, 7), then a = b. 


. If a and b are two vectors in R? and k and / are real 


numbers, then (k —/)(a+ b) = ka — la + kb — lb. 


from Pı(1,0,—1) to 
P2(5, 3, 2) is (—4, —3, —3). 


. Force and acceleration are vector quantities. 
. Velocity and speed are vector quantities. 
. Displacement and distance are scalar quantities. 


. If a particle is at the point (2, —1) in the plane and 


moves from that point with velocity vector v = (1, 3), 
then after 2 units of time have passed, the particle will 
be at the point (5, 1). 


8. The vector (2, 3, —2) is the same as 2i + 3j — 2k. 


10. 


11. 


12. 


13. 


14. 
15. 


16. 
17. 


Miscellaneous Exercises for Chapter 1 


1. 


. A set of parametric equations for the line through 


(1, —2, 0) that is parallel to (—2, 4, 7) is x = 1 — 2t, 
y=4t-—2,7z=7. 


A set of parametric equations for the line through 


(1, 2,3) and (4,3,2) is x =4—-3t, y=3-t,z= 
t+2. 
The line with parametric equations x =2— 
3t, y=t+1, z=2t—3 has symmetric form 
x+2 1-22 3 

si a a 


The two sets of parametric equations x = 3t — 1, 
y=2-t, z=2t+5 and x=2-— 6t, y=2t+1, 
z = 7 — 4t both represent the same line. 


The parametric equations x =2sinft, y = 2cost, 
where 0 < t < x, describe a circle of radius 2. 


The dot product of two unit vectors is 1. 


For any vector a in R” and scalar k, we have ||ka|| = 
k|lall. 
Ifa, u € R” and |ju|| = 1, then proj,a = (a+ u)u. 


For any vectors a, b, ¢ in R*, we have a x (b x ¢) = 
(a x b) xc. 


If Pi, P2, ..., P, are the vertices of a regular polygon 
having n sides and if O is the center of the polygon, 


n ne š ‘ 
show that }*"_, O P; = 0. The case n = 5 is shown in 


18. 


19. 
20. 
21. 


22. 


23. 


24. 


25. 
26. 


27. 


28. 


29. 


30. 
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The volume of a parallelepiped determined by the vec- 
tors a, b, ¢ € R? is |(a x ¢)- bl. 


||a||b — ||b]|a is a vector. 
(a x b)» c — (a x ¢) -b is a scalar. 


The plane containing the points (1, 2, 1), (3, —1, 0), 
and (1, 0, 2) has equation 5x + 2y + 4z = 13. 


The plane containing the points (1, 2, 1), (3, —1, 0), 
and (1,0,2) is given by the parametric equations 
x = 2s, y = —3s —2t,z=t-s. 


If Aisa5 x 7 matrix and B isa 7 x 7 matrix, then BA 
isa 7 x 5 matrix. 


120 3 
102 1 
IfA= 592 0 , then 
0 8 0 —6 
—1 2 1 1 0 
det A = 5 2 0O}+9] -1 2 1 
0 0 -6 0 0 -6 


If A is ann x n matrix, then det(2A) = 2 det A. 


The surface having equation r = 4sin@ in cylindrical 
coordinates is a cylinder of radius 2. 


The surface having equation p = 4 cos @ in spherical 
coordinates is a sphere of radius 2. 


The surface having equation p cos 0 sing = 3 inspher- 
ical coordinates is a plane. 


The surface having equation p = 3 in spherical coor- 
dinates is the same as the surface whose equation in 
cylindrical coordinates is r? + z? = 9. 


The surface whose equation in cylindrical coordinates 
is z = 2r is the same as the surface whose equation in 
spherical coordinates is p = 77/6. 


Figure 1.111. (Hint: Don’t try using coordinates. Use 
instead sketches, geometry, and perhaps translations or 
rotations.) 
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P; 


P, P; 


Figure 1.111 The case n = 5. 


2. Find parametric equations for the line through the point 


(1, 0, —2) that is parallel to the line x = 3t + 1, y = 
5—7t,z=t+ 12. 


. Find parametric equations for the line through the 


point (1,0, —2) that intersects the line x = 3t + 1, 
y = 5 — 7t, z = t + 12 orthogonally. (Hint: Let x9 = 
3to + 1, yo = 5 — 7tọ, Zo = to + 12 be the point where 
the desired line intersects the given line.) 


. Given two points Po(a1, a2, a3) and Pı (b1, b2, b3), we 


have seen in equations (3) and (4) of §1.2 how to 
parametrize the line through Po and P; as r(t) = 


— o 
O Po + t Po Pi, where t can be any real number. (Recall 
— 
that r = OP, the position vector of an arbitrary point 
P on the line.) 
—> 
(a) For what value of t does r(t) = O Po? For what 
—> 
value of t does r(t) = O P;? 


(b) Explain how to parametrize the line segment join- 
ing Po and P}. (See Figure 1.112.) 


- 
=- 
=- 
-- 


x 


Figure 1.112 The segment joining Po 
and P is a portion of the line containing 
Po and P}. (See Exercise 4.) 


(c) Give a set of parametric equations for the line seg- 
ment joining the points (0, 1, 3) and (2, 5, —7). 


5. Recall that the perpendicular bisector of a line seg- 


ment in R? is the line through the midpoint of the seg- 
ment that is orthogonal to the segment. 


10. 


11. 


12. 


(a) Give a set of parametric equations for the perpen- 
dicular bisector of the segment joining the points 
P,(—1, 3) and P2(5, —7). 

(b) Given general points Pı(aı, a2) and P (bj, b2), 
provide a set of parametric equations for the per- 
pendicular bisector of the segment joining them. 


. If we want to consider a perpendicular bisector of a 


line segment in R?, we will find that the bisector must 

be a plane. 

(a) Give an (implicit) equation for the plane that serves 
as the perpendicular bisector of the segment join- 
ing the points Pı(6, 3, —2) and P:(—4, 1, 0). 

(b) Given general points Pı(aı, a2,a3) and 
Pz(bı, b2, b3), provide an equation for the plane 
that serves as the perpendicular bisector of the 
segment joining them. 


. Generalizing Exercises 5 and 6, we may define the per- 


pendicular bisector of a line segment in R” to be the 

hyperplane through the midpoint of the segment that 

is orthogonal to the segment. 

(a) Give an equation for the hyperplane in R> that 
serves as the perpendicular bisector of the seg- 
ment joining the points P,(1,6,0,3,—2) and 
P,(—3, —2, 4, 1, 0). 

(b) Given arbitrary points Pı(ai,..., an) and 
P(biı,..., bn) in R”, provide an equation for 
the hyperplane that serves as the perpendicular 
bisector of the segment joining them. 


. Ifa and b are unit vectors in RÌ, show that 


la x bil? + (ae by = 1. 


. (a) Ifa. b = a» c, does it follow that b = ce? Explain 


your answer. 


(b) If ax b=axe, does it follow that b=c? 
Explain. 


Show that the two lines 


hii x=t-—3, 
hb: x=4-2t, 


y=1-2t, z 
y=4 +3, z= 


are parallel, and find an equation for the plane that 
contains them. 


Consider the two planes x + y = 1 and y +z = 

These planes intersect in a straight line. 

(a) Find the (acute) angle of intersection between these 
planes. 

(b) Give a set of parametric equations for the line of 
intersection. 


Which of the following lines whose parametric equa- 
tions are given below are parallel? Are any the same? 


(a) x =4t+6,y=2-—2t,z=8t+1 


13. 


14. 


15. 


16. 


(b) x = 3 — 6t, y = 3t,z =4 — 9t 

(c) x=2-2t,y=t+4,2=-4-7 

(d) x=2t+4,y=1-—t,z=3t-2 

Determine which of the planes whose equations are 


given below are parallel and which are perpendicular. 
Are any of the planes the same? 


(a) 2x+3y-—z= 

(b) —6x +4y —2z7+2=0 
(c) x+y—-—z=2 

(d) 10x + I5y —5z=1 
(e) 3x —2y+z=1 


(a) What is the angle between the diagonal of a cube 
and one of the edges it meets? (Hint: Locate the 
cube in space in a convenient way.) 


(b) Find the angle between the diagonal of a cube and 
the diagonal of one of its faces. 

Mark each of the following statements with a 1 if you 

agree, —1 if you disagree: 

(1) Red is my favorite color. 

(2) I consider myself to be a good athlete. 

(3) I like cats more than dogs. 

(4) I enjoy spicy foods. 

(5) Mathematics is my favorite subject. 


Your responses to the preceding “questionnaire” may 
be considered to form a vector in R. Suppose that you 
and a friend calculate your respective “response vec- 
tors” for the questionnaire. Explain the significance of 
the dot product of your two vectors. 


The median of a triangle is the line segment that joins 
a vertex of a triangle to the midpoint of the opposite 
side. The purpose of this problem is to use vectors to 
show that the medians of a triangle all meet at a point. 


(a) Using Figure 1.113, write the vectors BM, and C M2 
— — 
in terms of A B and AC. 


A 


M, 


B 


Figure 1.113 Two of the three medians 
of a triangle in Exercise 16. 


17. 


18. 


19. 


20. 


21. 
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(b) Let P be the point of intersection of BM, and CM). 
— —. — — 

Write BP and CP interms of AB and AC. 


(c) Use the fact that CB = CP + PB = CA + AB to 
show that P must lie two-thirds of the way from B 
to Mı and two-thirds of the way from C to M3. 

(d) Nowuse part (c) to show why all three medians must 
meet at P. 


Suppose that the four vectors a, b, c, and d in R? are 
coplanar (i.e., that they all lie in the same plane). Show 
that then (a x b) x (c x d) = 0. 


Show that the area of the triangle, two of whose 
sides are determined by the vectors a and b (see 
Figure 1.114), is given by the formula 


1 
Area = zy lal? Ibl? — (aby. 


a 


Figure 1.114 The triangle in Exercise 18. 


Let A(1,3,—1), B(4,-1,3), C(2,5,2), and 

D(5, 1, 6) be the vertices of a parallelogram. 

(a) Find the area of the parallelogram. 

(b) Find the area of the projection of the parallelogram 
in the xy-plane. 


(a) For the line Z in R? given by the equation ax + 
by = d, find a vector v that is parallel to /. 

(b) Find a vector n that is normal to / and has first 
component equal to a. 

(c) If Po(xo, yo) is any point in R?, use vectors to de- 
rive the following formula for the distance from 


Po tol: 
|axo + byo — d| 
Distance from Pp tol = ————__. 
Ja? + b2 


To do this, you’ll find it helpful to use Figure 1.115, 
where P;(x;, y1) 1s any point on J. 

(d) Find the distance between the point (3, 5) and the 
line 8x — 5y = 2. 


(a) If Po(xo, yo, Zo) is any point in RÌ, use vectors 
to derive the following formula for the distance 
from Po to the plane TI having equation Ax + 
By + Cz=D: 

|Axo + Byo + Czo — D| 


A? + B24 C? 


Distance from Pp to II = 
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Figure 1.115 Geometric construction for 
Exercise 20. 


Figure 1.116 should help. (Pi(x1, y1, z1) is any 
point in IT.) 


Distance 


II: Ax + By+Cz=D 


x 


Figure 1.116 Geometric construction for 
Exercise 21. 


(b) Find the distance between the point (1, 5, —3) and 
the plane x — 2y + 2z + 12 = 0. 


22. (a) Let P be a pointin space that is not contained in the 


plane TI that passes through the three noncollinear 
points A, B, and C. Show that the distance between 
P and TI is given by the expression 
Ip ° (b x ©) 
llb x el 


where p = AP, b = AB, ande = AC. 


(b) Use the result of part (a) to find the distance 
between (1,0, —1) and the plane containing the 
points (1, 2, 3), (2, —3, 1), and (2, —1, 0). 


23. Let A, B, C, and D denote four distinct points in RÈ. 


(a) Show that A, B, and C are collinear if and only if 
AB x AC = 0. 
(b) Show that A, B, C, and D are coplanar if and only 


if (AB x AČ) -CD = 0. 


— 
24. Let x = OP, the position vector of a point P in R°. 


Consider the equation 
xek 1l 


Ixl = v2 


25. 


26. 


27. 


Describe the configuration of points P that satisfy the 
equation. 


Let a and b be two fixed, nonzero vectors in R3, and let 
c bea fixed constant. Explain how the pair of equations, 


aex=C 
axx=b, 
completely determines the vector x € R3. 


(a) Give examples of vectors a, b, ¢ in R? that show 
that, in general, it is not true that a x (b x €) = 
(a x b) x c. (That is, the cross product is not as- 
sociative.) 


(b) Use the Jacobi identity (see Exercise 30 of §1.4) 
to show that, for any vectors a, b, ¢ in R’, 


ax (bxc)=(axb)xe 
if and only if 
(ex a)xb=0. 


(a) Given an arbitrary (i.e., not necessarily regular) 
tetrahedron, associate to each of its four triangular 
faces a vector outwardly normal to that face with 
length equal to the area of that face. (See Fig- 
ure 1.117.) Show that the sum of these four vec- 
tors is zero. (Hint: Describe v,, ..., v4 in terms of 
some of the vectors that run along the edges of the 
tetrahedron.) 


vi 
V4 


V3 


Figure 1.117 The tetrahedron of part 
(a) of Exercise 27. 


(b) Recall that a polyhedron is a closed surface in 
R? consisting of a finite number of planar faces. 
Suppose you are given the two tetrahedra shown 
in Figure 1.118 and that face ABC of one is con- 
gruent to face A’ BC’ of the other. If you glue the 
tetrahedra together along these congruent faces, 
then the outer faces give you a six-faced polyhe- 
dron. Associate to each face of this polyhedron an 
outward-pointing normal vector with length equal 
to the area of that face. Show that the sum of these 
six vectors is zero. 


B B 
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Figure 1.118 In Exercise 27(b), glue the two tetrahedra shown along congruent faces. 


28. 


29. 


30. 


31. 


(c) Outline a proof of the following: Given an n-faced 
polyhedron, associate to each face an outward- 
pointing normal vector with length equal to the 
area of that face. Show that the sum of these n 
vectors is zero. 


Consider a right tetrahedron, that is, a tetrahedron 
that has a vertex R whose three adjacent faces are pair- 
wise perpendicular. (See Figure 1.119.) Use the result 
of Exercise 27 to show the following three-dimensional 
analogue of the Pythagorean theorem: If a, b, and 
c denote the areas of the three faces adjacent to R 
and d denotes the area of the face opposite R, then 
@ +b +e =g. 


Figure 1.119 The right tetrahedron 
of Exercise 28. The three faces 
containing the vertex R are pairwise 
perpendicular. 


(a) Use vectors to prove that the sum of the squares 
of the lengths of the diagonals of a parallelogram 
equals the sum of the squares of the lengths of the 
four sides. 


(b) Give an algebraic generalization of part (a) for R”. 


Show that for any real numbers a1, ..., an, bi, .--, Dn 
we have 
n 2 n n 
gef BE 
i=l i=l i=l 
To raise a square (n x n) matrix A to a positive integer 


power n, one calculates A” as A - A--- A (n times). 
(a) Calculate successive powers A, A?, A?, A4 of the 


f { 1 
matrix A =| 4 ck 


(b) Conjecture the general form of A” for the matrix 
A of part (a), where n is any positive integer. 

(c) Prove your conjecture in part (b) using mathemat- 
ical induction. 


32. A square matrix A is called nilpotent if A” = 0 for 
some positive power n. 


01 1 
(a) Show thatA = | 0 0O O0 | is nilpotent. 
00 0 
Q (b) Use a calculator or computer to show that A = 
00000 
10000 
0 1 0 0 O | is nilpotent. 
00410 0 
0001 0 


Q 33. Then x n matrix H, whose ijth entry is 1/(i + j — 1) 
is called the Hilbert matrix of order n. 
(a) Write out Hj, H3, H4, Hs, and He. Use a com- 
puter to calculate their determinants exactly. What 
seems to happen to det H, as n gets larger? 


(b) Now calculate Hiọ and det Hio. If you use exact 
arithmetic, you should find that det Hig 4 0 and 
hence that Ho is invertible. (See Exercises 30-38 
of §1.6 for more about invertible matrices.) 


(c) Now give a numerical approximation A for Hio. 
Calculate the inverse matrix B of this approxima- 
tion, if your computer allows. Then calculate AB 
and BA. Do you obtain the 10 x 10 identity matrix 
Iio in both cases? 


(d) Explain what parts (b) and (c) suggest about the 
difficulties in using numerical approximations in 
matrix arithmetic. 


As a child, you may have played with a popular toy called a 
Spirograph®. With it one could draw some appealing geomet- 
ric figures. The Spirograph consists of a small toothed disk with 
several holes in it and a larger ring with teeth on both inside 
and outside as shown in Figure 1.120. You can draw pictures by 
meshing the small disk with either the inside or outside circles 
of the ring and then poking a pen through one of the holes of 
the disk while turning the disk. (The large ring is held fixed.) 
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Figure 1.120 The Spirograph. Figure 1.122 Hypocycloids witha = 3, b = 2 anda = 6, b = 5. 


An idealized version of the Spirograph can be obtained 
by taking a large circle (of radius a) and letting a small circle 
(of radius b) roll either inside or outside it without slipping. 
A “Spirograph” pattern is produced by tracking a particu- 
lar point lying anywhere on (or inside) the small circle. Exer- 
cises 34—37 concern this set-up. 


34. Suppose that the small circle rolls inside the larger 
circle and that the point P we follow lies on the circum- 
ference of the small circle. If the initial configuration 
is such that P is at (a, 0), find parametric equations 
for the curve traced by P, using angle t from the posi- 
tive x-axis to the center B of the moving circle. (This 
configuration is shown in Figure 1.121.) The result- 
ing curve is called a hypocycloid. Two examples are 
shown in Figure 1.122. 


y 


Figure 1.121 The coordinate 
configuration for finding parametric 
equations for a hypocycloid. 


35. Now suppose that the small circle rolls on the outside 
of the larger circle. Derive a set of parametric equa- 
tions for the resulting curve in this case. Such a curve 
is called an epicycloid, shown in Figure 1.123. 


36. (a) A cusp (or corner) occurs on either the hypocy- 
cloid or epicycloid every time the point P on the 
small circle touches the large circle. Equivalently, 


37. 


Figure 1.123 An epicycloid with 
a=4,b=1. 


this happens whenever the smaller circle rolls 
through 27. Assuming that a/b is rational, how 
many cusps does a hypocycloid or epicycloid 
have? (Your answer should involve a and b in some 
way.) 

(b) Describe in words and pictures what happens when 
a/b is not rational. 


Consider the original Spirograph set-up again. If we 
now mark a point P at a distance c from the center 
of the smaller circle, then the curve traced by P is 
called a hypotrochoid (if the smaller circle rolls on 
the inside of the larger circle) or an epitrochoid (if 
the smaller circle rolls on the outside). Note that we 
must have b < a, but we can have c either larger or 
smaller than b. (If c < b, we get a “true” Spirograph 
pattern in the sense that the point P will be on the 
inside of the smaller circle. The situation when c > b 
is like having P mounted on the end of an elongated 
spoke on the smaller circle.) Give a set of parametric 
equations for the curves that result in this way. (See 
Figure 1.124.) 


Exercises 38—43 are made feasible through the use of appropri- 
ate software for graphing in polar, cylindrical, and spherical 


coordinates. (Note: When using software for graphing in spher- 


ical coordinates, be sure to check the definitions that are used 


for the angles y and 0.) 


Q 38. 


@ 40. 


of 


Figure 1.124 The configuration for finding 
parametric equations for epitrochoids. 


(a) Graph the curve in R? whose polar equation is 
r = cos 20. 

(b) Graph the surface in R? whose cylindrical equation 
isr = cos 20. 

(c) Graph the surface in R? whose spherical equation 
is p = cos29. 

(d) Graph the surface in R? whose spherical equation 
is p = cos 20. 


. (a) Graph the curve in R? whose polar equation is 


r=sin20. 

(b) Graph the surface in R? whose cylindrical equation 
isr = sin 28. 

(c) Graph the surface in R? whose spherical equation 
is p = sin 2ọ. 

(d) Graph the surface in R? whose spherical equation 
is p = sin 20. Compare the results of this exercise 
with those of Exercise 38. 


(a) Graph the curve in R? whose polar equation is 
r = cos 30. 

(b) Graph the surface in R? whose cylindrical equation 
isr = cos 30. 

(c) Graph the surface in R? whose spherical equation 
is p = cos 3g. 

(d) Graph the surface in R? whose spherical equation 
is p = cos 30. 


. (a) Graph the curve in R? whose polar equation is 


r = sin 30. 

(b) Graph the surface in R? whose cylindrical equation 
isr = sin 30. 

(c) Graph the surface in R? whose spherical equation 
is p = sin 3ọ. 


@ 43. 


44. 


45. 
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(d) Graph the surface in R? whose spherical equation 
is p = sin 30. Compare the results of this exercise 
with those of Exercise 40. 


. (a) Graph the curve in R? whose polar equation is 


r=1+sin z, (This curve is known as a nephroid, 
meaning “kidney shaped.”) 

(b) Graph the surface in R? whose cylindrical equation 
isr = l + sin A 

(c) Graph the surface in R? whose spherical equation 
iso = 1 + sin §. 

(d) Graph the surface in R? whose spherical equation 
isp = 1 +sin A 

(a) Graph the curve in R? whose polar equation is 
r=, 

(b) Graph the surface in R? whose cylindrical equation 
isr = 8. 

(c) Graph the surface in R? whose spherical equation 
is p = Q. 

(d) Graph the surface in R? whose spherical equation 
is 0 = 0, where 7/2 < g < wm and0 < 0 < 4x. 

Consider the solid hemisphere of radius 5 pictured in 

Figure 1.125. 

(a) Describe this solid, using spherical coordinates. 

(b) Describe this solid, using cylindrical coordinates. 


Z 


Figure 1.125 The solid hemisphere of 
Exercise 44. 
Consider the solid cylinder pictured in Figure 1.126. 


(a) Describe this solid, using cylindrical coordinates 
(position the cylinder conveniently). 


(b) Describe this solid, using spherical coordinates. 


e 6 >| 


Figure 1.126 The solid 
cylinder of Exercise 45. 
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2.1 Functions of Several Variables; 
Graphing Surfaces 


The volume and surface area of a sphere depend on its radius, the formulas 
describing their relationships being V = trr? and S = 4mr?. (Here V and S 
are, respectively, the volume and surface area of the sphere and r its radius.) 
These equations define the volume and surface area as functions of the radius. 
The essential characteristic of a function is that the so-called independent variable 
(in this case the radius) determines a unique value of the dependent variable (V 
or S). No doubt you can think of many quantities that are determined uniquely 
not by one variable (as the volume of a sphere is determined by its radius) but 
by several: the area of a rectangle, the volume of a cylinder or cone, the average 
annual rainfall in Cleveland, or the national debt. Realistic modeling of the world 
requires that we understand the concept of a function of more than one variable 
and how to find meaningful ways to visualize such functions. 


Definitions, Notation, and Examples 


A function, any function, has three features: (1) a domain set X, (2) a codomain 
set Y, and (3) a rule of assignment that associates to each element x in the 
domain X a unique element, usually denoted f(x), in the codomain Y. We will 
frequently use the notation f: X — Y for a function. Such notation indicates all 
the ingredients of a particular function, although it does not make the nature of 
the rule of assignment explicit. This notation also suggests the “mapping” nature 
of a function, indicated by Figure 2.1. 


EXAMPLE 1 Abstract definitions are necessary, but it is just as important that 
you understand functions as they actually occur. Consider the act of assigning to 
each U.S. citizen his or her social security number. This pairing defines a function: 
Each citizen is assigned one social security number. The domain is the set of U.S. 
citizens and the codomain is the set of all nine-digit strings of numbers. 

On the other hand, when a university assigns students to dormitory rooms, it 
is unlikely that it is creating a function from the set of available rooms to the set of 
students. This is because some rooms may have more than one student assigned 
to them, so that a particular room does not necessarily determine a unique student 
occupant. + 


Figure 2.2 Every y € Y is “hit” 
by at least one x € X. 


f 


Ss 


X Y 


Figure 2.3 The element b € Y 
is not the image of any x € X. 
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DEFINITION 1.1 The range of a function f:X — Y is the set of those 
elements of Y that are actual values of f. That is, the range of f consists of 
those y in Y such that y = f(x) for some x in X. 

Using set notation, we find that 


Range f = {y € Y | y= f(x) for some x € X}. 


In the social security function of Example 1, the range consists of those nine- 
digit numbers actually used as social security numbers. For example, the number 
000-00-0000 is not in the range, since no one is actually assigned this number. 


DEFINITION 1.2 A function f: X — Y is said to be onto (or surjective) 
if every element of Y is the image of some element of X, that is, if range 


Par 


The social security function is not onto, since 000-00-0000 is in the codomain 
but not in the range. Pictorially, an onto function is suggested by Figure 2.2. A 
function that is not onto looks instead like Figure 2.3. You may find it helpful to 
think of the codomain of a function f as the set of possible (or allowable) values 
of f, and the range of f as the set of actual values attained. Then an onto function 
is one whose possible and actual values are the same. 


DEFINITION 1.3 A function f:X — Y is called one-one (or injective) if 
no two distinct elements of the domain have the same image under f. That 
is, f is one-one if whenever xı, x. € X and x; # x2, then f(x1) Æ f (x2). 
(See Figure 2.4.) 


SF a. 
eee bee 


one-one not one-one 


Figure 2.4 The figure on the left depicts a one-one mapping; the one 
on the right shows a function that is not one-one. 


One would expect the social security function to be one-one, but we have heard 
of cases of two people being assigned the same number so that, alas, apparently 
it is not. 

When you studied single-variable calculus, the functions of interest were 
those whose domains and codomains were subsets of R (the real numbers). It 
was probably the case that only the rule of assignment was made explicit; it is 
generally assumed that the domain is the largest possible subset of R for which 
the function makes sense. The codomain is generally taken to be all of R. 
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EXAMPLE 2 Suppose f:R — Ris given by f(x) = x. Then the domain and 
codomain are, explicitly, all of R, but the range of f is the interval [0, oo). Thus 
f is not onto, since the codomain is strictly larger than the range. Note that f is 
not one-one, since f(2) = f(—2) = 4, but 2 4 —2. + 


EXAMPLE 3 Suppose g is a function such that g(x) = vx — 1. Then if we 
take the codomain to be all of R, the domain cannot be any larger than [1, oo). 
If the domain included any values less than one, the radicand would be negative 
and, hence, g would not be real-valued. + 


Now we’re ready to think about functions of more than one real variable. In 
the most general terms, these are the functions whose domains are subsets X of 
R” and whose codomains are subsets of R”, for some positive integers n and m. 
(For simplicity of notation, we’ll take the codomains to be all of R”, except when 
specified otherwise.) That is, such a function is a mapping f: X C R” — R” that 
associates to a vector (or point) x in X a unique vector (point) f(x) in R”. 


EXAMPLE 4 Let T:R? —> R be defined by T(x, y, z) = xy + xz + yz. We 
can think of T as a sort of “temperature function.” Given a point x = (x, y, z) in 
R?, T(x) calculates the temperature at that point. + 


EXAMPLE 5 Let L: R” — R be given by L(x) = ||x||. This is a “length func- 
tion” in that it computes the length of any vector x in R”. Note that L is not 
one-one, since L(e;) = L(e;) = 1, where e; and e; are any two of the standard 
basis vectors for R”. L also fails to be onto, since the length of a vector is always 
nonnegative. + 


EXAMPLE 6 Consider the function given by N(x) = x/||x|| where x is a vector 
in R?. Note that N is not defined if x = 0, so the largest possible domain for N is 
R? — {0}. The range of N consists of all unit vectors in R*. The function N is the 
“normalization function,” that is, the function that takes a nonzero vector in R? 
and returns the unit vector that points in the same direction. + 


EXAMPLE 7 Sometimes a function may be given numerically by a table. 
One such example is the notion of windchill—the apparent temperature one 
feels when taking into account both the actual air temperature and the speed of 
the wind. A standard table of windchill values is shown in Figure 2.5.! From it 
we see that if the air temperature is 20°F and the windspeed is 25 mph, the wind- 
chill temperature (“how cold it feels”) is 3°F. Similarly, if the air temperature is 
35°F and the windspeed is 10 mph, then the windchill is 27°F. In other words, 
if s denotes windspeed and tf air temperature, then the windchill is a function 
W(s,t). + 


The functions described in Examples 4, 5, and 7 are scalar-valued functions, 
that is, functions whose codomains are R or subsets of R. Scalar-valued functions 
are our main concern for this chapter. Nonetheless, let’s look at a few examples 
of functions whose codomains are R” where m > 1. 


1 NOAA, National Weather Service, Office of Climate, Water, and Weather Services, “NWS Wind Chill 
Temperature Index.” February 26, 2004. <http://www.nws.noaa.gov/om/windchill> (July 31, 2010). 
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Windspeed (mph) 
Air Temp 
(deg F) 5 10 15 20 25 30 35 40 45 50 55 60 
40 36 34 32 30 29 28 28 27 26 26 25 25 
35 31 2i 25 24 23 22 21 20 19 19 18 17 
30 25 21 19 17 16 15 14 13 12 12 11 10 
25 19 15 18 11 9 8 7 6 5 4 4 3 
20 13 9 6 4 3 1 0 1 2 3 3 4 
I5 7 3 0 2 4 5 7 8 9 10 11 iil 
10 1 4 7 9 11 12 14 15 16 17 18 19 
5 5 10 13 15 17 19 21 22 23 24 25 26 
0 11 16 19 22 24 26 27 29 30 31 32 33 
5 16 22 26 29 31 33 34 36 37 38 39 40 
—10 22 28 32 35 37 39 41 43 44 45 46 48 
=S 28 35 39 42 44 46 48 50 sil 52 54 33 
=20 34 41 45 48 51 53 55 57 58 60 61 62 
25 40 47 Sil 59 58 60 62 64 65 67 68 69 
—30 46 53 58 61 64 67 69 71 12 74 75 76 
=3) 52 59 64 68 71 13 76 78 19) 81 82 84 
—40 57 66 71 74 78 80 82 84 86 88 89 91 
—45 63 12 77 81 84 87 89 91 95 95 97 98 
z Figure 2.5 Table of windchill values in English units. 

EXAMPLE 8 Define f: R > R? by f(t) = (cost, sint, t). The range of f is the 

curve in R? with parametric equations x = cost, y = sint, z = t. If we think of 

t as a time parameter, then this function traces out the corkscrew curve (called a 

helix) shown in Figure 2.6. + 

EXAMPLE 9 We can think of the velocity of a fluid as a vector in RÌ. This 

vector depends on (at least) the point at which one measures the velocity and also 

y the time at which one makes the measurement. In other words, velocity may be 


x 


Figure 2.6 The helix of 
Example 8. The arrow shows the 
direction of increasing t. 


Figure 2.7 A water 
pitcher. The velocity v of 
the water is a function 
from a subset of Rf to R°. 


considered to be a function v: X C R > R?. The domain X is a subset of R* 
because three variables x, y, z are required to describe a point in the fluid and a 
fourth variable t is needed to keep track of time. (See Figure 2.7.) For instance, 
such a function v might be given by the expression 


V(x, y, z, t) = xyztit (x7 — y*)j + Bz + 0k. + 


You may have noted that the expression for v in Example 9 is considerably 
more complicated than those for the functions given in Examples 4—8. This is 
because all the variables and vector components have been written out explicitly. 
In general, if we have a function f: X C R” —> R”, then x € X can be written as 
x = (X1, X2, . . - , Xn) and f can be written in terms of its component functions 
fis f2, ---, fm. The component functions are scalar-valued functions of x € X 
that define the components of the vector f(x) € R”. What results is a morass of 
symbols: 


f(x) = f(%1, X2, .--, Xn) (emphasizing the variables) 
= (fi(x), fo(x), .--, fn) (emphasizing the component functions) 


= (fii, X2 -+5 Xn), PAX X25 -+3 Xn) oo +s JmX; X2; - -+3 Xn)) 


(writing out all components). 
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For example, the function L of Example 5, when expanded, becomes 


L(x) = LG, x2...) Xn) = yf tad tee tad. 


The function N of Example 6 becomes 


N(x) = x (1, X2, x3) 
Ixl x24 x2 4.2 
x} X2 X3 


(ordre TE Jotta 
and, hence, the three component functions of N are 


Xi X2 


B N2(x1, X2, X3) = B 
fx? +x +x? f/x? + x3 + x3 


N3(x1, x2, x3) = 


Nin ta) = 


X3 


x? + x2 +x? 


Although writing a function in terms of all its variables and components has 
the advantage of being explicit, quite a lot of paper and ink are used in the process. 
The use of vector notation not only saves space and trees but also helps to make 
the meaning of a function clear by emphasizing that a function maps points in 
R” to points in R”. Vector notation makes a function of 300 variables look “just 
like” a function of one variable. Try to avoid writing out components as much as 
you can (except when you want to impress your friends). 


Visualizing Functions 


No doubt you have been graphing scalar-valued functions of one variable for so 
long that you give the matter little thought. Let’s scrutinize what you’ve been do- 
ing, however. A function f: X C R —> R takes a real number and returns another 
real number as suggested by Figure 2.8. The graph of f is something that “lives” 
in R?. (See Figure 2.9.) It consists of points (x, y) such that y = f(x). That is, 


Graph f = {(x, f(x)) |x € X} ={(@, y) | x € X, y = fœ). 


The important fact is that, in general, the graph of a scalar-valued function 
of a single variable is a curve—a one-dimensional object—sitting inside two- 
dimensional space. 


y 
ih ae -a 
x pa f(x) es a rr 
——3—_R ——*—R x 
xX 


Figure 2.8 A function f: X C R >R. Figure 2.9 The graph of f. 
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Now suppose we have a function f: X C R? — R, that is, a function of two 
variables. We make essentially the same definition for the graph: 


Graph f = {(x, f(x)) | x € X}. 


Of course, x = (x, y) is a point of R?. Thus, {(x, f(x))} may also be written as 


(x, y, f(x,y}, oras {(,y,z)|@,y)eX,z= fi, y). 


Hence, the graph of a scalar-valued function of two variables is something that 
sits in R?. Generally speaking, the graph will be a surface. 


EXAMPLE 10 The graph of the function 


1 1 7 
:R? —> R, y) = 5y Š 
f:R' > Prea = Ta 
is shown in Figure 2.10. For each point x = (x, y) in R?, the point in R? with 
coordinates (x, y, b y -y 4x? i) is graphed. + 
f(x,y) 


ain Wie xy-plane) 


Figure 2.10 The graph of f(x, y) = 5y? — y — fx? + 4. 


x 


Graphing functions of two variables is a much more difficult task than graph- 
ing functions of one variable. Of course, one method is to let a computer do 
the work. Nonetheless, if you want to get a feeling for functions of more than 
one variable, being able to sketch a rough graph by hand is still a valuable skill. 
The trick to putting together a reasonable graph is to find a way to cut down on 
the dimensions involved. One way this can be achieved is by drawing certain 
special curves that lie on the surface z = f(x, y). These special curves, called 
contour curves, are the ones obtained by intersecting the surface with horizontal 
planes z = c for various values of the constant c. Some contour curves drawn 
on the surface of Example 10 are shown in Figure 2.11. If we compress all the 
contour curves onto the xy-plane (in essence, if we look down along the posi- 
tive z-axis), then we create a “topographic map” of the surface that is shown in 
Figure 2.12. These curves in the xy-plane are called the level curves ofthe original 
function f. 

The point of the preceding discussion is that we can reverse the process in 
order to sketch systematically the graph of a function f of two variables: We 
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Figure 2.11 Some contour curves of the Figure 2.12 Some level curves of 
function in Example 10. the function in Example 10. 


first construct a topographic map in R? by finding the level curves of f, then 
situate these curves in R as contour curves at the appropriate heights, and finally 
complete the graph of the function. Before we give an example, let’s restate our 
terminology with greater precision. 


DEFINITION 1.4 Let f: X C R? > R be a scalar-valued function of two 
variables. The level curve at height c of f is the curve in R? defined by the 
equation f(x, y) = c, where c is a constant. In mathematical notation, 


Level curve at height c = {(x, y) € Rely Gay) — ce 


The contour curve at height c of f is the curve in R? defined by the two 
equations z = f(x, y) and z = c. Symbolized, 


Contour curve at height c = {(, y JER |z= f(x, y= c} : 


In addition to level and contour curves, consideration of the sections of a 
surface by the planes where x or y is held constant is also helpful. A section of a 
surface by a plane is just the intersection of the surface with that plane. Formally, 
we have the following definition: 


DEFINITION 1.5 Let f: X C R? > R be a scalar-valued function of two 
variables. The section of the graph of f by the plane x = c (where c 
is a constant) is the set of points (x, y, z), where z = f(x, y) and x =c. 
Symbolized, 

Section by x = c is Kes y,z) € R? |z = f(x, y), x = e)k 
Similarly, the section of the graph of f by the plane y = c is the set of 
points described as follows: 


Section by y =c is {(x, y,z) €R’ |z=fG, y), y =c}. 
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EXAMPLE 11 We’ll use level and contour curves to construct the graph of the 
function 
fF:R >R, f@ sy =4—2? -y 
By Definition 1.4, the level curve at height c is 
{(x, y)€ R?|4-—x°-y? =c} ={(x,y)|x?+y? =4-c}. 


Thus, we see that the level curves for c < 4 are circles centered at the origin of 
radius v4 — c. The level “curve” at height c = 4 is not a curve at all but just a 
single point (the origin). Finally, there are no level curves at heights larger than 4 
since the equation x? + y? = 4 — c has no real solutions in x and y. (Why not?) 
These remarks are summarized in the following table: 


c Level curve x? + y? =4-— c 
=5 x+y =9 
= x+y =5 

(0) x+y = 4 

1 x+y =3 

3 yry =] 

4 x+y =0 4> x=y=0 

c, where c > 4 empty 


Thus, the family of level curves, the “topographic map” of the surface z = 
4 — x? — y?, is shown in Figure 2.13. Some contour curves, which sit in R3, 
are shown in Figure 2.14, where we can get a feeling for the complete graph of 
z = 4 — x? — y?. It is a surface that looks like an inverted dish and is called a 
paraboloid. (See Figure 2.15.) To make the picture clearer, we have also sketched 
in the sections of the surface by the planes x = 0 and y = 0. The section by x = 0 
is given analytically by the set 


[œ y z ER |z=4- x- y’, x= 0} = [(,y,2)|z=4-y’}. 
Similarly, the section by y = 0 is 
(x,y,z) ER |z=4-x — y, y=0} = {@,0,2)|z=4-x°}. 


Figure 2.13 The topographic 
map ofz = 4 — x? — y? (i.e., 
several of its level curves). 


c=-5 x 
Figure 2.14 Some contour Figure 2.15 The graph of 
curves of z = 4— x? — y?. f(x, y)=4- x- y. 
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Figure 2.16 Some level curves 
of g(x, y) = y? — x’. 


Figure 2.17 The contour curves 
and graph of g(x, y) = y? — x. 


Since these sections are parabolas, it is easy to see how this surface obtained its 
name. , 


EXAMPLE 12 We’ll graph the function g: R? —> R, g(x, y) = y? — x”. The 
level curves are all hyperbolas, with the exception of the level curve at height 0, 
which is a pair of intersecting lines. 


la Level curve y? — x? = c 
—4 xy =4 
—1 x? — y =1 
0 | y -x= => aH) => ys 
y- =l 
4 y- x =4 


The collection of level curves is graphed in Figure 2.16. The sections by x = c 
are 


rols =r aeaa y E 
These are clearly parabolas in the planes x = c. The sections by y = c are 
(œ, y, z) lz =y -x,y =c} = {(c, y, z) |z = ° — x°}, 


which are again parabolas. The level curves and sections generate the contour 
curves and surface depicted in Figure 2.17. Perhaps understandably, this surface 
is called a hyperbolic paraboloid. + 


EXAMPLE 13 We compare the graphs of the function f(x, y) = 4 — x? — y? 
of Example 11 with that of 
h:R? —{(0,0)} > R, = A(x, y) = ln(x? + y’). 
The level curve of h at height c is 
[æy ER |m? + y?) =c} = fa, y) xy =e}. 


Since e° > 0 for all c € R, we see that the level curve exists for any c and is a 
circle of radius vet = e°’. 


c | Level curve x? + y? = e° 
—5 x+y =e 
= Gaye 

0 w pya 

1 x+y =e 

3 rya 

4 Vye 


The collection of level curves is shown in Figure 2.18 and the graph in Figure 2.19. 
Note that the section of the graph by x = 0 is 


{@, y, z) € R? | z=In(x* +y’, x =0} = {(0, y, 2) | z =n?) = 21n|y]}. 


Figure 2.19 The graph of 
z = ln(x? + y’), shown with 
sections by x = 0 and y = 0. 
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y 


Figure 2.18 The collection of level curves of z = In (x? + y’). 


The section by y = 0 is entirely similar: 


{(x, yz) € R? | z=In(@x? + y’), y= 0} = {x 0, z) | z= n(x”) = 21n |x|}. 
+ 


In fact, if we switch from Cartesian to cylindrical coordinates, it is quite 
easy to understand the surfaces in both Examples 11 and 13. In view of the 
Cartesian/cylindrical relation x? + y? = r?, we see that for the function f of 
Example 11, 


z=4-=x -y =4- R? +y) =4-7P. 
For the function h of Example 13, we have 
z = lnx? + y?) = ln (rô) = 2hr, 


where we assume the usual convention that the cylindrical coordinate r is non- 
negative. Thus both of the graphs in Figures 2.15 and 2.19 are of surfaces of 
revolution obtained by revolving different curves about the z-axis. As a result, 
the level curves are, in general, circular. 


The preceding discussion has been devoted entirely to graphing scalar-valued 
functions of just two variables. However, all the ideas can be extended to more 
variables and higher dimensions. If f: X C R” — R is a (scalar-valued) function 
of n variables, then the graph of f is the subset of R”+! given by 


Graph f = {(x, f(x)) | x€ X} 


= {(x1, vee Xny Xn+1) | (x1, . seyn) € X, 


Xn+1 = f(%1,---,%n)}- 
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Figure 2.20 The level sets of the 
function F(x, y,z)=x+y+z 
are planes in R°. 


Figure 2.21 The unit circle 
x+y =l. 


x 


Figure 2.22 The sphere of radius 
a, centered at (xo, yo, Zo). 


(1,0) 


y 


(The compactness of vector notation makes the definition of the graph ofa function 
ofn variables exactly the same as in (1).) The level set at height c of sucha function 
is defined by 
Level set at height c = {x € R” | f(x) =c} 
=A (Ki Xe cess den) | F CE 5.05 Xa) Se} 
While the graph of f is a subset of R”+!, a level set of f is a subset of R”. This 


makes it possible to get some geometric insight into graphs of functions of three 
variables, even though we cannot actually visualize them. 


EXAMPLE 14 Let F: R? — R be given by F(x, y, z) =x + y + z. Then the 
graph of F is the set {(x, y, z, w) | w = x + y + z} and is a subset (called a 
hypersurface) of R*, which we cannot depict adequately. Nonetheless, we can 
look at the level sets of F, which are surfaces in R°. (See Figure 2.20.) We have 


Level set at height c = {(x, y,z) |x +y+z=c}. 
Thus, the level sets form a family of parallel planes with normal vector i+ j + k. 
+ 


Surfaces in General 


Not all curves in R? can be described as the graph of a single function of one 
variable. Perhaps the most familiar example is the unit circle shown in Figure 2.21. 
Its graph cannot be determined by a single equation of the form y = f(x) (or, for 
that matter, by one of the form x = g(y)). As we know, the graph of the circle may 
be described analytically by the equation x? + y? = 1. In general, a curve in R? 
is determined by an arbitrary equation in x and y, not necessarily one that isolates 
y alone on one side. In other words, this means that a general curve is given by an 
equation of the form F(x, y) = c (i.e., a level set of a function of two variables). 
The analogous situation occurs with surfaces in R°. Frequently a surface is 
determined by an equation of the form F(x, y, z) =c (i.e., as a level set of a 
function of three variables), not necessarily one of the form z = f(x, y). 


EXAMPLE 15 Asphereisa surface in R? whose points are all equidistant from 
a fixed point. If this fixed point is the origin, then the equation for the sphere is 
lx — O|] = |x|] = a, (3) 


where a is a positive constant and x = (x, y, z) is a point on the sphere. If we 
square both sides of equation (3) and expand the (implicit) dot product, then we 
obtain perhaps the familiar equation of a sphere of radius a centered at the origin: 


Vtr t aa’. (4) 


If the center of the sphere is at the point xo = (xo, yo, Zo), rather than the origin, 
then equation (3) should be modified to 


Ix — Xo] = a. (5) 


(See Figure 2.22.) 
When equation (5) is expanded, the following general equation for a sphere 
is obtained: 


2 


(x — xo) +(y — wr +E- zo) =a 
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In the equation for a sphere, there is no way to solve for z uniquely in terms 
of x and y. Indeed, if we try to isolate z in equation (4), then 


Z=- -y 


so we are forced to make a choice of positive or negative square roots in order to 
solve for z: 
z=Vae—-x*-y or z=-Va 


The positive square root corresponds to the upper hemisphere and the negative 
square root to the lower one. In any case, the entire sphere cannot be the graph 
of a single function of two variables. + 


Of course, the graph of a function of two variables does describe a surface in 
the “level set” sense. If a surface happens to be given by an equation of the form 
z= f(x, y) 


for some appropriate function f: X C R? — R, then we can move z to the oppo- 
site side, obtaining 


fœ, y)-z=0. 


If we define a new function F of three variables by 


F(x, y,z) = f(x, y)—z, 


then the graph of f is precisely the level set at height 0 of F. We reiterate this 
point since it is all too often forgotten: The graph of a function of two variables is 
a surface in R? and is a level set of a function of three variables. However, not all 
level sets of functions of three variables are graphs of functions of two variables. 
We urge you to understand this distinction. 


Quadric Surfaces 


Conic sections, those curves obtained from the intersection of a cone with various 
planes, are among the simplest, yet also the most interesting, of plane curves: 
They are the circle, the ellipse, the parabola, and the hyperbola. Besides being 
produced in a similar geometric manner, conic sections have an elegant algebraic 
connection: Every conic section is described analytically by a polynomial equation 
of degree two in two variables. That is, every conic can be described by an equation 
that looks like 


Ax? + Bxy + Cy? + Dx + Ey+F=0 


for suitable constants A,..., F. 

In R?, the analytic analogue of the conic section is called a quadric surface. 
Quadric surfaces are those defined by equations that are polynomials of degree 
two in three variables: 


Ax? + Bxy + Cxz+ Dy? + Eyz + Fz? + Gx + Hy+Iz+J=0. 


To pass from this equation to the appropriate graph is, in general, a cumbersome 
process without the aid of either a computer or more linear algebra than we 
currently have at our disposal. So, instead, we offer examples of those quadric 
surfaces whose axes of symmetry lie along the coordinate axes in R? and whose 
corresponding analytic equations are relatively simple. In the discussion that 
follows, a, b, and c are constants, which, for convenience, we take to be positive. 
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z = constant 


Plane section 
is an ellipse 


x x 

Figure 2.23 The ellipsoid Figure 2.24 The elliptic paraboloid Figure 2.25 The hyperbolic 
2L y 2? n y? oaz y g 

z2 prank . po Rp paraboloid = = 75 — a: 


Ellipsoid (Figure 2.23.) x?/a* + y?/b? + 27/c? =1. 

This is the three-dimensional analogue of an ellipse in the plane. The sections 
of the ellipsoid by planes perpendicular to the coordinate axes are all ellipses. 
For example, if the ellipsoid is intersected with the plane z = 0, one obtains the 
standard ellipse x?/a* + y*/b* = 1, z = 0. Ifa = b = c, then the ellipsoid is a 
sphere of radius a. 


Elliptic paraboloid (Figure 2.24.) z/c = x?/a? + y?/b?. 

(The roles of x, y, and z may be interchanged.) This surface is the graph of 
a function of x and y. The paraboloid has elliptical (or single-point or empty) 
sections by the planes “z = constant” and parabolic sections by “x = constant” or 


Figure 2.26 The elliptic “y = constant” planes. The constants a and b affect the aspect ratio of the elliptical 
cone a E x? y cross sections, and the constant c affects the steepness of the dish. (Larger values 
e at b of c produce steeper paraboloids.) 
z Hyperbolic paraboloid (Figure 2.25.) z/c = y?/b? — x? /a?. 


(Again the roles of x, y, and z may be interchanged.) We saw the graph of this 
surface earlier in Example 12 ofthis section. It is shaped like a saddle whose “x = 
constant” or “y = constant” sections are parabolas and “z = constant” sections 
are hyperbolas. 


Elliptic cone (Figure 2.26.) z?/c? = x? /a? + y*/b’. 
The sections by “z = constant” planes are ellipses. The sections by x = 0 or 
y = 0 are each a pair of intersecting lines. 


Hyperboloid of one sheet (Figure 2.27.) x?/a? + y?/b? — z?/2 = 1. 
The term “one sheet” signifies that the surface is connected (i.e., that you can 
travel between any two points on the surface without having to leave the surface). 
The sections by “z = constant” planes are ellipses and those by “x = constant” 
or “y = constant” planes are hyperbolas, hence, this surface’s name. 


Figure 2.27 The graph of the Hyperboloid of two sheets (Figure 2.28.) 2?/c? — x?/a? — y? /b? = 1. 
: y 2 i The fact that the left-hand side of the defining equation is the opposite of the left 
a a us p 22` lisa side of the equation for the previous hyperboloid is what causes this surface to 


hyperboloid of one sheet. consist of two pieces instead of one. More precisely, consider the sections of the 


Figure 2.28 The graph of 
the equation 


Figure 2.29 The 


hyperboloids 


2 2 2 
z x y 
An at ge 


asymptotic to the cone 


2.1 Exercises 


1. Let f:R — R be given by f(x) = 2x? + 1. 
(a) Find the domain and range of f. 


(b) Is f one-one? 
(c) Is f onto? 
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surface by planes of the form z = k for different constants k. These sections are 
thus given by 


or, equivalently, by 


If —c < k < c, then 0 < k?/c? < 1. Thus, k?/c? — 1 < 0, and so the preceding 
equation has no solution in x and y. Hence, the section by z = k, where |k| < c, 
is empty. If |k| > c, then the section is an ellipse. The sections by “x = constant” 
or “y = constant” planes are hyperbolas. 


In the same way that the hyperbolas 


x y 
an +1 
are asymptotic to the lines y = +(b/a)x, the hyperboloids 
2 2 2 
Zz x y 
eee 


are asymptotic to the cone 


This is perhaps intuitively clear from Figure 2.29, but let’s see how to prove it 
rigorously. In our present context, to say that the hyperboloids are asymptotic 
to the cones means that they look more and more like the cones as |z| becomes 
(arbitrarily) large. Analytically, this should mean that the equations for the hy- 
perboloids should approximate the equation for the cone for sufficiently large |z]. 
The equations of the hyperboloids can be written as follows: 
2 y 2 z2 a 

As |z| — 00, c?/z? — 0, so the right side of the equation for the hyperboloids 
approaches z?/c?. Hence, the equations for the hyperboloids approximate that of 
the cone, as desired. 


(c) Find a way to restrict the codomain to make a new 
function with the same rule of assignment as g that 
is onto. 


Find the domain and range of each of the functions given in 
Exercises 3—7. 


2. Let g: R? — R be given by g(x, y) = 2x? +3y? —7. 
(a) Find the domain and range of g. 


(b) Find a way to restrict the domain to make a new 
function with the same rule of assignment as g that 
is one-one. 


x 
3. f(x,y) =~ 
y 


4. f(x, y)= ln(x + y) 
5. g(x, y, z) = V2 +(y—2P +4 1 
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1 
e g(x, y,2= (=e 
7. f(x, y)= (x+y. +9) 
8. Let f:R? > R? be defined by f(x, y)= (x +y, 


ye*, xy +7). Determine the component functions 
of f. 


1 
y-1 


9. Determine the component functions of the function v 
in Example 9. 


10. Let f: R? — R? be defined by f(x) = x + 3j. Write out 
the component functions of f in terms of the compo- 
nents of the vector x. 


11. Consider the mapping that assigns to a nonzero vector 
x in R° the vector of length 2 that points in the direction 
opposite to x. 

(a) Give an analytic (symbolic) description of this 
mapping. 


17. fœ, y) = Vx +y 
18. f(x, y) = 4x? + 9y? 
19. f(x, y) = xy 


20. f(x, y)= 


21. f(x, y)= 


<Sl[e al 


22. f(x, y)=3-—2x-—y 
23. f(x,y) = |x| 


In Exercises 24-27, use a computer to provide a portrait of 
the given function g(x, y). To do this, (a) use the computer to 
help you understand some of the level curves of the function, 
and (b) use the computer to graph (a portion of) the surface 
z = g(x, y). In addition, mark on your surface some of the 
contour curves corresponding to the level curves you obtained 
in part (a). (See Figures 2.10 and 2.11.) 


Pp 24. g(x, y) = ye" 


Q 25. 


Ó 2. 56.) = 62 ayer? 


(b) If x = (x, y, z), determine the component func- 


tions of this mapping. g(x, y) =x? —xy 


12. Consider the function f: R? —> R? given by f(x) = Ax, 


2 -1 f ; 
where A = 5 0 | and the vector x in R? is Q 27. g(x, y) = sin(2 — x“ — y*) 
-6 3 x? +y +1 


written as the 2 x 1 column matrix x = | jia | 
2 


(a) Explicitly determine the component functions of 
f in terms of the components x1, x2 of the vector 
(i.e., column matrix) x. 


(b) Describe the range of f. 
13. Consider the function f: Rt > R? given by f(x) = Ax, 


20 -1 1 
where A=] 0 3 0 0 | and the vector x in R* 
20 -1 1 
Xx] 
is written as the 4 x 1 column matrix x = a 
3 
X4 


(a) Determine the component functions of f in terms 
of the components x1, x2, X3, x4 of the vector (i.e., 
column matrix) x. 


(b) Describe the range of f. 


In each of Exercises 14—23, (a) determine several level curves 
of the given function f (make sure to indicate the height c of 
each curve); (b) use the information obtained in part (a) to 
sketch the graph of f. 


14. f(x,y) =3 
15. f(x, y) =x? +y? 
16. f(x,y) =x? +y —9 


28. The ideal gas law is the equation PV = kT, where P 
denotes the pressure of the gas, V the volume, T the 
temperature, and k is a positive constant. 

(a) Describe the temperature T ofthe gas as a function 
of volume and pressure. Sketch some level curves 
for this function. 


(b) Describe the volume V of the gas as a function 
of pressure and temperature. Sketch some level 
curves. 


29. (a) Graph the surfaces z = x? and z = y’. 


(b) Explain how one can understand the graph of the 
surfaces z = f(x) and z = f(y) by considering 
the curve in the uv-plane given by v = f(u). 


(c) Graph the surface in R? with equation y = x?. 


To 30. Use a computer to graph the family of level curves for 


the functions in Exercises 20 and 21 and compare your 
results with those obtained by hand sketching. How do 
you account for any differences? 


31. Givena function f(x, y), can two different level curves 
of f intersect? Why or why not? 


In Exercises 32—36, describe the graph of g(x, y, z) by com- 


puting some level surfaces. (If you prefer, use a computer to 


assist you.) 
32. g(x, y,z) =x —2y4+3z 
33. g(x, y, zZ) =x? +y? -z 


34. g(x,y, z) =x? +y +z’ 
35. g(x, y, Z) = x? + 9y? + 42? 
36. g(x, y, z) = xy — yz 


37. (a) Describe the graph of g(x, y, z) = x? + y? by 
computing some level surfaces. 
(b) Suppose g is a function such that the expres- 
sion for g(x, y,z) involves only x and y (ie., 
g(x, y, z) = h(x, y)). What can you say about the 
level surfaces of g? 


(c) Suppose g is a function such that the expression 
for g(x, y, z) involves only x and z. What can you 
say about the level surfaces of g? 


(d) Suppose g is a function such that the expression 
for g(x, y, z) involves only x. What can you say 
about the level surfaces of g? 


38. This problem concerns the surface determined by the 
graph of the equation x? + xy — xz = 2. 
(a) Find a function F(x, y,z) of three variables so 
that this surface may be considered to be a level 
set of F. 


(b) Find a function f(x, y) of two variables so that 
this surface may be considered to be the graph of 


z= f(x, y). 
39. Graph the ellipsoid 
ey, 
—+— =l, 
7 F 9 FZ 


Is it possible to find a function f(x, y) so that this ellip- 
soid may be considered to be the graph of z = f(x, y)? 
Explain. 


Sketch or describe the surfaces in R? determined by the equa- 
tions in Exercises 40-46. 
2 2 
x x 
40. z=- y? ee 
4 y 41. z 7 y 
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2 2 3, 2 


y z y z 
42.x=—-——~ a 
X 7 9 Ben ae ag 16 0 
x2 y? 2 
‘a, =f eI 
4 1679 
x2 y? 
45. —4+—=27-1 
5 16 
46. z= y?+2 


We can look at examples of quadric surfaces with centers 
or vertices at points other than the origin by employing a 
change of coordinates of the form x =x — xo, Y=y— Yo, 
and Z = z — Zo. This coordinate change simply puts the point 
(xo, Yo, Zo) Of the xyz-coordinate system at the origin of the 
X yZ-coordinate system by a translation of axes. Then, for ex- 
ample, the surface having equation 


@-1P QO+2° 
4 9 


+(z-5ř=1 


can be identified by setting x = x — 1, ý = y + 2, and Z = 
z— 5, so that we obtain 


=i 


which is readily seen to be an ellipsoid centered at (1, —2, 5) 
of the xyz-coordinate system. By completing the square in x, 
y, or z as necessary, identify and sketch the quadric surfaces 
in Exercises 47—52. 


47. x- 1} +O +1 =(z+3F 

48. z = 4x? + (y +2) 

49. 4x? +y? +z? +8x=0 

50. 4x? + y? — 4z + 8x — 4y +4=0 
51. x? +2y? — 6x -—z+10=0 

52. 9x? + 4y? — 3627 — 8y — 144z = 104 


As you may recall, limit processes are central to the development of calculus. The 
mathematical and philosophical debate in the 18th and 19th centuries surrounding 
the meaning and soundness of techniques of taking limits was intense, questioning 
the very foundations of calculus. By the middle of the 19th century, the infamous 
“e — ô” definition of limits had been devised, chiefly by Karl Weierstrass and 
Augustin Cauchy, much to the chagrin of many 20th (and 21st) century students of 
calculus. In the ensuing discussion, we study both the intuitive and rigorous mean- 
ings of the limit of a function f: X C R” — R” and how limits lead to the notion 
of a continuous function, our main object of study for the remainder of this text. 
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Figure 2.30 The graph of f of 
Example 1. 


The Notion of a Limit 
For a scalar-valued function of a single variable, f: X C R — R, you have seen 
the statement 
lim f(x) = L 

and perhaps have an intuitive understanding of its meaning. In imprecise terms, 
the preceding equation (read “The limit of f(x) as x approaches a is L.”) means 
that you can make the numerical value of f(x) arbitrarily close to L by keeping 
x sufficiently close (but not equal) to a. This idea generalizes immediately to 
functions f: X C R” — R”. In particular, by writing the equation 


lim f(x) = L, 


where f: X C R” — R”, we mean that we can make the vector f(x) arbitrarily 
close to the limit vector L by keeping the vector x € X sufficiently close (but not 
equal) to a. 

The word “close” means that the distance (in the sense of $1.6) between f(x) 
and L is small. Thus, we offer a first definition of limit using the notation for 
distance. 


DEFINITION 2.1 (INTUITIVE DEFINITION OF LIMIT) The equation 
lim f(x) = L, 
x>a 


where f: X C R” —> R”, means that we can make ||f(x) — L|| arbitrarily 
small (i.e., near zero) by keeping ||x — all sufficiently small (but nonzero). 


In the case of a scalar-valued function f: X C R” — R, the vector length 
||f(x) — L]| can be replaced by the absolute value | f(x) — L|. Similarly, if f is a 
function of just one variable, then ||x — al] can be replaced by |x — a|. 


EXAMPLE 1 Suppose that f:R — R is given by 


0 ifx<l 


f=, ifx>1` 


The graph of f is shown in Figure 2.30. What should lim,_,; f(x) be? The limit 
can’t be 0, because no matter how near we make x to | (i.e., no matter how small 
we take |x — 1|), the values of x can be both slightly larger and slightly smaller 
than 1. The values of f corresponding to those values of x larger than 1 will 
be 2. Thus, for such values of x, we cannot make | f(x) — 0| arbitrarily small, 
since, for x > 1, | f(x) — 0| = |2 — 0| = 2. Similarly, the limit can’t be 2, since 
no matter how small we take |x — 1|, x can be slightly smaller than 1. For x < 1, 
f(x) = 0 and, therefore, we cannot make | f(x) — 2| = |0 — 2| = 2 arbitrarily 
small. Indeed, it should now be clear that the limit can’t be L for any L e R. 
Hence, lim,_,; f(x) does not exist for this function. + 


EXAMPLE 2 Let f: R? —> R? be defined by f(x) = 5x. (That is, f is five times 
the identity function.) Then it should be obvious intuitively that 


lim f(x) = lim 5x = Si+ Sj. 
x>HJ 


x—>i+j 


2.2 | Limits 99 


Indeed, if we write x = xi + yj, then 
Ifœ — (Si + Spl] = II(Sxi + 5yj) — (Si + 5) 
= |5¢œ — Di +50 — Djl = vV25¢ — 1} +250 — 1}? 
= 5/(@ — 1} +0- 1. 
This last quantity can be made as small as we wish by keeping 
Ix- G+) =V@-1P +0 -1P 
sufficiently small. + 


EXAMPLE 3 Now suppose that g: R” —> R” is defined by g(x)=3x. We 
claim that, for any a € R”, 


lim 3x = 3a. 


In other words, we claim that ||3x — 3a|| can be made as small as we like by 
keeping ||x — al] sufficiently small. Note that 


||3x — 3al| = ||3(« — all = 3]|x — all. 


This means that if we wish to make ||3x — 3a|| no more than, say, 0.003, then 
we may do so by making sure that ||x — a|| is no more than 0.001. If, instead, 
we want ||3x — 3al| to be no more than 0.0003, we can achieve this by keeping 
||x — a] no more than 0.0001. Indeed, if we want ||3x — 3a|| to be no more than 
any specified amount (no matter how small), then we can achieve this by making 
sure that ||x — al] is no more than one-third of that amount. 

More generally, if h: R” —> R” is any constant k times the identity function 
(i.e., h(x) = kx) and a € R” is any vector, then 


lim h(x) = lim kx = ka. + 


The main difficulty with Definition 2.1 lies in the terms “arbitrarily small” 
and “sufficiently small.” They are simply too vague. We can add some precision 
to our intuition as follows: Think of applying the function f: X C R” — R” as 
performing some sort of scientific experiment. Letting the variable x take on 
a particular value in X amounts to making certain measurements of the input 
variables to the experiment, and the resulting value f(x) can be considered to be the 
outcome of the experiment. Experiments are designed to test theories, so suppose 
that this hypothetical experiment is designed to test the theory that as the input is 
closer and closer to a, then the outcome gets closer and closer to L. To verify this 
theory, you should establish some acceptable (absolute) experimental error for the 
outcome, say, 0.05. That is, you want ||f(x) — L|| < 0.05, if ||x — all is sufficiently 
small. Then just how small does ||x — a|| need to be? Perhaps it turns out that you 
must have ||x — al| < 0.02, and that if you do take ||x — al| < 0.02, then indeed 
|f(x) — L|| < 0.05. Does this mean that your theory is correct? Not yet. Now, 
suppose that you decide to be more exacting and will only accept an experimental 
error of 0.005 instead of 0.05. In other words, you desire ||f(x) — L|| < 0.005. 
Perhaps you find that if you take ||x — al| < 0.001, then this new goal can be 
achieved. Is your theory correct? Well, there’s nothing sacred about the number 
0.005, so perhaps you should insist that || f(x) — L|| < 0.001, or that ||f(x) — L]| < 
0.00001. The point is that if your theory really is correct, then no matter what 
(absolute) experimental error € you choose for your outcome, you should be 
able to find a “tolerance level” 6 for your input x so that if ||x — al] < ô, then 
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||f(x) — LI] < e. Itis this heuristic approach that motivates the technical definition 
of the limit. 


DEFINITION 2.2 (RIGOROUS DEFINITION OF LIMIT) Let f: X C R” > R” 
be a function. Then to say 

lim f(x) = L 

x>a 
means that given any € > 0, you can find a ô > 0 (which will, in general, 
depend on €) such that if x € X and 0 < ||x — al] < ô, then ||f(x) — LI| < €. 


The condition 0 < ||x —al| simply means that we care only about values 
f(x) when x is near a, but not equal to a. Definition 2.2 is not easy to use in 
practice (and we will not use it frequently). Moreover, it is of little value insofar 
as actually evaluating limits of functions is concerned. (The evaluation of the 
limit of a function of more than one variable is, in general, a difficult task.) 


EXAMPLE 4 So that you have some feeling for working with Definition 2.2, 
let’s see rigorously that 


ar — 5y + 2z) = 12 


m 
(x,y,z) >(1,-1, 


(as should be “obvious”’). This means that given any number e€ > 0, we can find 
a corresponding 6 > 0 such that 


if 0 < |I(x, y,z) —C, -1, 2)|] < ô, then |3x — 5y + 2z — 12| < e. 
(Note the uses of vector lengths and absolute values.) We’ll present a formal 
proof in the next paragraph, but for now we’ll do the necessary background 
calculations in order to provide such a proof. First, we need to rewrite the two 
inequalities in such a way as to make it more plausible that the €-inequality could 
arise algebraically from the 6-inequality. From the definition of vector length, the 
5-inequality becomes 

0 < y(x- 1} +O +1 +e- 2 <6. 
If this is true, then we certainly have the three inequalities 


V(x — 1) = |x -1| <4, 
V(iyt+1P =|y+1| < ô, 
y(z— 2} = |z- 2| < ô. 


Now, rewrite the left side of the €-inequality and use the triangle inequality (2) 


of §1.6: 
[3x — 5y + 2z — 12| = |3(x — 1) — 5(y + 1) + 2(z — 2)| 

< BE- DI + 15@ + DI + 12€ — 2)| 
= 34x — l| + 5ly + 1+ 2ļz — 2l. 

Thus, if 

0 < |I(x, y,z)— (1, —1, 2| < 6, 
then 
Ix-—1] <6, |y+1) <6, and |z-—2| <6, 
so that 


[3x — 5y + 2z — 12| < 3|x — 1| +5ly + 1] + 2|z—2| 
< 35 +55 + 26 = 106. 


x 


Figure 2.31 A closed ball 
centered at a. 
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If we think of ô as a positive quantity that we can make as small as desired, then 
105 can also be made small. In fact, it is 106 that plays the role of e€. 

Now for a formal, “textbook” proof: Given any € > 0, choose ô > 0 so that 
ô < €/10. Then, if 


0 < |\(x, y,z) —C, —1, 2)]] < 6, 
it follows that 
Ix 1] <6, |y+1| <6, and |z—2| <8, 


so that 
[3x — 5y + 2z — 12| < 3|x —1]+5|y + 1| +2|z — 2| 
< 38 + 58 + 26 
€ 
S oi 
Thus, lima, y,.)+(1,-1,2)(3% — Sy + 2z) = 12, as desired. + 


Using the same methods as in Example 4, you can show that 
lim(aıxı + ax + +++ + anXn) = ab; + azb2 +-+ + anbn 
i 


for any a;,i = 1,2,...,n. 


Some Topological Terminology 


Before discussing the geometric meaning of the limit of a function, we need to in- 
troduce some standard terminology regarding sets of points in R”. The underlying 
geometry of point sets of a space is known as the topology of that space. 

Recall from §2.1 that the vector equation ||x — a|| = r, where x and a are 
in R? and r > 0, defines a sphere of radius r centered at a. If we modify this 
equation so that it becomes the inequality 


|x — all <r, (1) 


then the points x € R? that satisfy it fill out what is called a closed ball shown in 
Figure 2.31. Similarly, the strict inequality 

Ix — al] <r (2) 
describes points x € R? that are a distance of less than r from a. Such points 
determine an open ball of radius r centered at a, that is, a solid ball without the 
boundary sphere. 

There is nothing about the inequalities (1) and (2) that tie them to R3. In fact, 
if we take x and a to be points of R”, then (1) and (2) define, respectively, closed 
and open n-dimensional balls of radius r centered at a. While we cannot draw 
sketches when n > 3, we can see what (1) and (2) mean when n is 1 or 2. (See 
Figures 2.32 and 2.33.) 


y y 
a Sm, 

P| `‘ 
á ` 
I r 4 
I 
rt a | 
\ d 

x A + x 
XN p 
bh ad 


Figure 2.32 The closed and open balls (disks) in R? defined by ||x — al] < r and 
lx- all <r. 
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r r 


a a o 


a a 


Figure 2.33 The closed and open balls (intervals) in R 
defined by |x — a| < r and |x — a| < r. 


DEFINITION 2.3 A set X C R” is said to be open in R” if, for each point 
x € X, there is some open ball centered at x that lies entirely within X. A 
point x € R” is said to be in the boundary ofa set X C R” if every open ball 
centered at x, no matter how small, contains some points that are in X and 
also some points that are not in X. A set X C R” is said to be closed in R” 
if it contains all of its boundary points. Finally, a neighborhood of a point 
x € X is an open set containing x and contained in X. 


Itis an easy consequence of Definition 2.3 that a set X is closed in R” precisely 
if its complement R” — X is open. 


EXAMPLE 5 The rectangular region 
X = {(x,y)€ R? | —1 <x<l,—l<y<2} 


is open in R?. (See Figure 2.34.) Each point in X has an open disk around it 
contained entirely in the rectangle. The boundary of X consists of the four sides 


of the rectangle. (See Figure 2.35.) + 
y 


a 
L 


Figure 2.34 The graph of X. 


Figure 2.37 The set X of 
Example 7. 


x 


Figure 2.36 The set X 
of Example 6 consists of 
the nonnegative 
coordinate axes. 


Figure 2.35 Every open disk 
about a point on a side of rectangle 
X of Example 5 contains points in 
both X and R° — X. 


EXAMPLE 6 The set X consisting of the nonnegative coordinate axes in R? in 
Figure 2.36 is closed since the boundary of X is just X itself. + 


EXAMPLE 7 Don’t be fooled into thinking that sets are always either open or 
closed. (That is, a set is not a door.) The set 


X={(x, y) ER |0<x<1,0<y< 1} 


shown in Figure 2.37 is neither open nor closed. It’s not open since, for example, 
the point (4, 0) that lies along the bottom edge of X has no open disk around it 
that lies completely in X. Furthermore, X is not closed, since the boundary of X 
includes points of the form (x, 1) for O < x < 1 (why?), which are not part of X. 

+ 
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The Geometric Interpretation of a Limit 
Suppose that f: X C R” — R”. Then the geometric meaning of the statement 


lim f(x) = L 


is as follows: Given any € > 0, you can find a corresponding ô > 0 such that if 
points x € X are inside an open ball of radius 6 centered at a, then the correspond- 
ing points f(x) will remain inside an open ball of radius € centered at L. (See 
Figure 2.38.) 


y Z 


Figure 2.38 Definition of a limit: Given an open ball B. centered at L (right), you 
can always find a corresponding ball B; centered at a (left), so that points in Bs N X 
are mapped by f to points in Be. 


We remark that for this definition to make sense, the point a must be such 
that every neighborhood of it in R” contains points x € X distinct from a. Such 
a point a is called an accumulation point of X. (Technically, this assumption 
should also be made in Definition 2.2.) A point a € X is called an isolated point 
of X if it is not an accumulation point, that is, if there is some neighborhood of a 
in R” containing no points of X other than a. 

From these considerations, we see that the statement lim,-_., f(x) = L really 
does mean that as x moves toward a, f(x) moves toward L. The significance of 
the “open ball” geometry is that entirely arbitrary motion is allowed. 


EXAMPLE 8 Let f:R? — {(0, 0)} — R be defined by 

x2 — y 

x2 + y2 ` 

Let’s see what happens to f as x = (x, y) approaches 0 = (0, 0). (Note that f is 
undefined at the origin, although this is of no consequence insofar as evaluating 


limits is concerned.) Along the x-axis (i.e., the line y = 0), we calculate the value 
of f to be 


fœ, y= 


0 el 
fa; lea 


Thus, as x approaches 0 along the line y = 0, the values of f remain constant, 
and so 


J&)=1; 


x—0 along y=0 


Along the y-axis, however, the value of f is 


0-y? 
0O,y)= “_=-—], 
fO, y) O+ y2 
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Hence, 


lim f(x)=-1. 


x—0 along x=0 


Indeed, the value of f is constant along each line through the origin. Along the 
line y = mx, m constant, we have 


F ) x -mx xX- m) l1-m 
x,mx) = Z = . 
xL +m? x*(1+m?) l+m? 
Therefore, 
im f= —™ 
im x) = ; 
x—0 along y=mx 1 -+ m? 


As a result, the limit of f as x approaches 0 does not exist, since f has different 
“limiting values” depending on which direction we approach the origin. (See 
Figure 2.39.) That is, no matter how close we come to the origin, we can find 
points x such that f(x) is not near any number L € R. (In other words, every 
open disk centered at (0, 0), no matter how small, is mapped onto the interval 
{—1, 1].) If we graph the surface having equation 


ay? 

Z= 37 

x2 + y? 
(Figure 2.40), we can see quite clearly that there is no limiting value as x ap- 
proaches the origin. + 


f 
x R 
h PAR -1 0 1 
fhas constan fhas constant 
value 1 on value —1 on 
this line A this inte 
Figure 2.39 The function f(x, y) = (x? — y?)/(x? + y?) of ee e 
Example 8 has value | along the x-axis and value —1 along the Se De Sve 


y-axis (except at the origin). 


WARNING Example 8 might lead you to think you can establish that 
lim,_,, f(x) = L by showing that the values of f as x approaches a along straight- 
line paths all tend toward the same value L. Although this is certainly good 
evidence that the limit should be L, it is by no means conclusive. See Exercise 23 
for an example that shows what can happen. 


EXAMPLE 9 Another way we might work with the function f(x, y) = (x? — 
y?)/(x? + y?) of Example 8 is to rewrite it in terms of polar coordinates. Thus, 
let x = r cos 0, y = r sin 0. Using the Pythagorean identity and the double angle 
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formula for cosine, we obtain, for r 4 0, that 


x2—y? 2 cos?6—r? sin? r?(cos?0 — sin? 6) cos 20 


= = = = cos 26. 
x? +y?  r2cos20+r?2sin?0 — r?(cos? 6 + sin? 0) 1 


That is, for r Æ 0, 
f(x, y) = f(r cos0,r sinf) = cos 20. 


Moreover, to evaluate the limit of f as (x, y) approaches (0, 0), we only must 
have r approach 0; there need be no restriction on 6. Therefore, we have 


lim = f(x, y) = limcos20 =cos20. 
(x,y) (0,0) r>0 


This result clearly depends on 6. For example, if = 0 (which defines the x-axis), 
then 


lim cos20 = 1, 
r—0 along 0 = 0 


while if 0 = 2/4 (which defines the line y = x), then 


lim cos 26 = 0. 
r—0 along 0 = 17/4 


Thus, as in Example 8, we see that lim(.,y)~(0,0) f(x, y) fails to exist. + 


EXAMPLE 10 We use polar coordinates to investigate lim&,y)>(0,0) f(x, y), 
where f(x, y) = (x? + x5)/(x? + y3). 
We first rewrite the expression (x? + x°)/(x* + y?) using polar coordinates: 


x? +x’  r°cos? 0 +r’ costo ; Jaci 
5 5 = 37 = r(cos 0 + r° cos? 0). 
x+y r2 cos? 6 + r? sin’ 0 


Now —1 < cos@ < 1, which implies that 
a < cos? 0 + r° cos? 6 < 1+r’. 
Hence, 
—r(l+r)< f@,y)<r(l+r’). 
Asr —> 0, both the expressions —r(1 + r°) and r(1 + r°) approach zero. Hence, 


we conclude that lima ).(0,0) f(x, y) = 0, since f is squeezed between two 
expressions with the same limit. + 


Properties of Limits 


One of the biggest drawbacks to Definition 2.2 is that it is not at all useful for 
determining the value of a limit. You must already have a “candidate limit” in 
mind and must also be prepared to confront some delicate work with inequalities 
to use Definition 2.2. The results that follow (which are proved in the addendum 
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to this section), plus a little faith, can be quite helpful for establishing limits, as 
the subsequent examples demonstrate. 


THEOREM 2.4 (UNIQUENESS OF LIMITS) Ifa limit exists, it is unique. That is, 
let f: X C R” — R”. If limy_,, f(x) = L and lim,_,, f(x) = M, then L = M. 


THEOREM 2.5 (ALGEBRAIC PROPERTIES) Let F, G: X C R” > R” be vector- 
valued functions, f, g: X C R” — R be scalar-valued functions, and let k € R 
be a scalar. 
1. If lim,_,, F(x) = L and lim,_,, G(x) = M, 
then limy_,,(F + G)(x) = L + M. 
2. If limy—., F(x) = L, then limy_,, kF(x) = kL. 
3. If limsa f(x) = L and limy-_,, g(x) = M, then limy-.a(fg)(x) = LM. 
4. If lima f(x) = L, g(x) £0 for x € X, and limy_,, g(x) = M # 0, then 
limy—-a( f/g)(x) = L/M. 


There is nothing surprising about these theorems—they are exactly the same 
as the corresponding results for scalar-valued functions ofa single variable. More- 
over, Theorem 2.5 renders the evaluation of many limits relatively straightforward. 


EXAMPLE 11 Either from rigorous considerations or blind faith, you should 
find it plausible that 


lim x=a and lim y=b. 
(x, y)—>(a,b) (x,y) (a,b) 


From these facts, it follows from Theorem 2.5 parts 1, 2, and 3 that 


lim (x? + 2xy — y?) =a? + 2ab — b’, 
(x,y)>(a,b) 


because, by part 1 of Theorem 2.5, 


lim : (x? + 2xy — y*) = limx? + lim 2xy + lim(—y’) 
x,y)—(a,b) 


and, by parts 2 and 3, 


lim : (x? + 2xy — y*) = (lim x)’ + 2(lim x)(lim y) — (lim y} 
x,y)—>(a,b) 


so that, from the facts just cited, 
lim por + 2xy — y’) =a? + 2ab — h. + 
(x.y)>(a,b) 


EXAMPLE 12 More generally, a polynomial in two variables x and y is any 
expression of the form 


d d 
P(x, y)= >D >. cux*y', 
k=0 1=0 


where d is some nonnegative integer and cą € R for k,l =0,...,d. That is, 
p(x, y) is an expression consisting of a (finite) sum of terms that are real number 
coefficients times powers of x and y. For instance, the expression x? + 2xy — y? 
in Example 11 is a polynomial. For any (a, b) € R?, we have, by part 1 of 
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Theorem 2.5, 
lim xs lim (cyx* 
(x,y) (a,b) PC i= 3 2 (x,y) > (a, A un y 9, 
so that, from part 2, 
lim x, y) = cy lim x*y! 
(x, y)>(a,b) PC y= 2 2 ki (x,y)—> (a,b) y 
and, from part 3, 


lim p(x, y)= 


(x,y)> (a,b) 


d 
X cu(limx*)(lim y’) 
1=0 


d 
k=0 
d d 
= J cyakb'. 
k=0 1=0 
Similarly, a polynomial in n variables x1, x2, ..., Xn is an expression of the form 
d 


ky ko kn 
P(%1,X2, +++, Xn) = 2. Ckik Xi X2 tt An > 


where d is some nonnegative integer and cy,...4, € R for ki, ..., kn =0,...,d. 
For example, a polynomial in four variables might look like this: 


2 8.2 
P(X, + , X4) = 3x{ XQ + X1X2X3X4 — 1x3 X4- 


Theorem 2.5 implies readily that 
k k kn 
lim $ cho ik ena =} cn- paa z aan + 


EXAMPLE 13 We evaluate lim x + xy +3 
(x,y)>(-1,0) x2y — Sxy + y2 + 1 


Using Example 12, we see that 


lim x*+xy+3=4, 
(x,y)>(-1,0) 


and 


lim x*y—5xy+y?+1=1(4 0). 
(x,y) (1,0) 


Thus, from part 4 of Theorem 2.5, we conclude that 
x? +xy +3 _4 


lim = 4. + 
@,y)> 1,0) x 2y 5xy +y? +1 1 


EXAMPLE 14 Ofcourse, not all limits of quotient expressions are as simple 
to evaluate as that of Example 13. For instance, we cannot use Theorem 2.5 to 
evaluate 

lim ss 

(x.y) +(0,0) x? + y4 

since lime, »)-+(0,0)(7 + y*) = 0. Indeed, since lime, »)+(0,0)(7 — y*)=0 as 
well, the expression (x? — y*)/(x? + y*) becomes indeterminate as (x, y) > 
(0, 0). To see what happens to the expression, we note that 

=a" x 


lim ——— = lim — = 1, 
x—>0 along y=0 x2 | yt x>0 x 


(3) 
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while 
2_ 4 _ 4 
im aa lim = —1. 
y+0 along x=0 x2 | y4 y>0 y4 
Thus, the limit in (3) does not exist. (Compare this with Example 8.) + 


The following result shows that evaluating the limit of a function 
f: X C R” — R” is equivalent to evaluating the limits of its (scalar-valued) com- 
ponent functions. First recall from §2.1 that f(x) may be rewritten as (fi(x), 


f2(x), Lig fn). 


THEOREM 2.6 Suppose f: X C R” —> R” is a vector-valued function. Then 
limy-, f(x) = L, where L = (L1, ..., Lm), if and only if limy_,, fix) = L; for 


i=l,...,m. 


EXAMPLE 15 Consider the linear mapping f: R” —> R” defined by f(x) = Ax, 
where A = (aij) is an m x n matrix of real numbers. (See Example 5 of §1.6.) 
Theorem 2.6 shows us that 


lim f(x) = Ab 


for any b = (b;,..., ba) in R”. If we write out the matrix multiplication, we have 


dil +t) Gin x) 

a21 *** An X2 

f(x) = Ax = aot ; 
| Amı *** Amn | I Xn | 


Q11X1 T 412X2 T +++ T ainXn 


a21X1 T 422X2 T ` `` T A2nXn 


Am1 X1 + Am2X2 + ` ` ` + AmnXn 
Therefore, the ith component function of f is 
fix) = di1X1 + ai2X2 + +++ + inXn. 
From Example 4, we have that 
lim fiX) = aiibi + ainba + -++ + dinbn 


for each i. Hence, Theorem 2.6 tells us that the limits of the component functions 
fit together to form a limit vector. We can, therefore, conclude that 


lim f(x) = (lim fi), ..., lim fa) 


(aiibi pos + ainbn, e.’ amıbı po + amnbn) 


II 


aiibi +-+: + ainbn 
azıb +--+ + ambn 

— = Ab, 
amıbı a amnbn 


once we take advantage of matrix notation. + 
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Continuous Functions 


For scalar-valued functions of a single variable, one often adopts the following 
attitude toward the notion of continuity: A function f: X C R > Ris continuous 
if its graph can be drawn without taking the pen off the paper. By this criterion, 
Figure 2.41 describes a continuous function y = f(x), while Figure 2.42 does not. 


Figure 2.41 The graph ofa 
continuous function. 


x 


Figure 2.44 The graph of a 
continuous function f(x, y). 


N 
x x 
Figure 2.42 The graph ofa Figure 2.43 The graph of f 
function that is not continuous. where f(x, y) = 0 if both x > 0 
and y > 0, and where f(x, y) = 1 
otherwise. 


We can try to extend this idea to scalar-valued functions of two variables: A 
function f: X C R? —> R is continuous if its graph (in R*) has no breaks in it. 
Then the function shown in Figure 2.43 fails to be continuous, but Figure 2.44 
depicts a continuous function. Although this graphical approach to continuity is 
pleasantly geometric and intuitive, it does have real and fatal flaws. For one thing, 
we can’t visualize graphs of functions of more than two variables, so how will we 
be able to tell in general ifa function f: X C R” —> R” is continuous? Moreover, 
it is not always so easy to produce a graph of a function of two variables that is 
sufficient to make a visual determination of continuity. This said, we now give a 
rigorous definition of continuity of functions of several variables. 


DEFINITION 2.7 Let f: X C R” —> R” and let a € X. Then f is said to be 
continuous at a if either a is an isolated point of X or if 


lim f(x) = f(a). 


If f is continuous at all points of its domain X, then we simply say that f is 
continuous. 


EXAMPLE 16 Consider the function f: R? — R defined by 


x+ xy 2y? 
—; z  If@,y)# (0,0) 
fœ, y)= xt+y . 


0 if (x, y) = (0, 0) 


Therefore, f (0, 0) = 0, but lim, y)-.(0,0) f(x, y) does not exist. (To see this, check 
what happens as (x, y) approaches (0,0) first along y = 0 and then along x = 0.) 
Hence, f is not continuous at (0,0). + 


It is worth noting that Definition 2.7 is nothing more than the “vectorized” 
version of the usual definition of continuity of a (scalar-valued) function of one 
variable. This definition thus provides another example of the power of our vector 
notation: Continuity looks the same no matter what the context. 
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One way of thinking about continuous functions is that they are the ones whose 
limits are easy to evaluate: When f is continuous, the limit of f as x approaches 
a is just the value of f at a. It’s all too tempting to get into the habit of behaving 
as if all functions are continuous, especially since the functions that will be of 
primary interest to us will be continuous. Try to avoid such an impulse. 


EXAMPLE 17 Polynomial functions inn variables are continuous. Example 12 
gives a sketch of the fact that 


: kı kn — kı k 
lim ) Chink ti e = J Checka A, 
x>a 


where x = (x1, .. . , Xn) anda = (a1, .. . , an) are in R”. If f: R” — R is defined 
by 


k kn 
FO) = Do cheat! xk, 
then the preceding limit statement says precisely that f is continuous ata. 


EXAMPLE 18 Linear mappings are continuous. If f: R” —> R” is defined by 
f(x) = Ax, where A is an m x n matrix, then Example 15 establishes that 


for all b € R”. Thus, f is continuous. + 


The geometric interpretation of the « — ô definition of a limit gives rise to a 
similar interpretation of continuity at a point: f: X C R” — R” is continuous at 
a point a € X if, for every open ball Be in R” of radius € centered at f(a), there 
is a corresponding open ball Bs in R” of radius ô centered at a such that points 
x € X inside B; are mapped by f to points inside Be. (See Figure 2.45.) Roughly 
speaking, continuity of f means that “close” points in X C R” are mapped to 
“close” points in R”. 


x 
Figure 2.45 Given an open ball B, about f(a) (right), you can always find a 


corresponding open ball Bs so that points in Bs N X are mapped to points in Be. 


In practice, we usually establish continuity of a function through the use 
of Theorems 2.5 and 2.6. These theorems, when interpreted in the context of 
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continuity, tell us the following: 


¢ The sum F + G of two functions F, G: X C R” — R” that are continuous 
at a € X is continuous at a. 

° For all k € R, the scalar multiple kF of a function F: X C R” — R” that 
is continuous at a € X is continuous at a. 

¢ The product fg and the quotient f/g (e 4 0) of two scalar-valued functions 
f,g:X C R” — R that are continuous at a € X are continuous at a. 

e F: X C R” — R” is continuous at a € X if and only if its component 
functions F;: X C R” > R,i = 1,...,m are all continuous at a. 


EXAMPLE 19 The function f: R? —> R? defined by 


f(x, y) = (x + y, x’y, y sin (xy) 


is continuous. In view of the remarks above, we can see this by checking that the 
three component functions 


fi@ y=xty, hæ, y)=xy, and f(x, y) = ysin(xy) 


are each continuous (as scalar-valued functions). Now fı and fọ are continuous, 
since they are polynomials in the two variables x and y. (See Example 17.) The 
function f3 is the product of two further functions; that is, 


f(x, y) = gx, y)h(x, y), 


where g(x, y) = y and A(x, y) = sin(xy). The function g is clearly continuous. 
(It’s a polynomial in two variables—one variable doesn’t appear explicitly!) The 
function h is a composite of the sine function (which is continuous as a function 
of one variable) and the continuous function p(x, y) = xy. From these remarks, 
it’s not difficult to see that 


lim h(x,y)= lim — sin(p(x, 
(x,y) (a,b) ( y) (x,y) (a,b) (p( y)) 


= sa ( lim p(x, ») ; 
(x,y)> (a,b) 


since the sine function is continuous. Thus, 


lim A(x, y) = sin p(a, b) = h(a, b), 
(x,y)> (a,b) 
because p is continuous. Thus, A, hence f3, and, consequently, f are all continuous 
on all of R?. + 


The discussion in Example 19 leads us to the following general result, whose 
proof we omit: 


THEOREM 2.8 Iff: X C R” — R” andg:Y C R” — R” are continuous func- 
tions such that range f C Y, then the composite function g o f: X C R” —> R” is 
defined and is also continuous. 
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Addendum: Proofs of Theorems 2.4, 2.5, 2.6, and 2.8 ——— 


For the interested reader, we establish the various results regarding limits of 
functions that we used earlier in this section. 


Proof of Theorem 2.4 The statement lim,_,, f(x) = L means that, given any € > 
0, we can find some 5; > 0 such that ifx € X and0 < ||x — al] < ô, then ||f(x) — 
L|| < €/2. (The reason for writing €/2 rather than € will become clear in a 
moment.) Similarly, limy_., f(x) = M means that, given any € > 0, we can find 
some ô > 0 such that if x € X and 0 < ||x — all < 4), then ||f(x) — M|| < €/2. 

Now let 6 = min(64,, 62); that is, we set ô to be the smaller of ô; and 55. If 
x€ X and 0 < ||x — al] < ô, then both ||f(x) — L|| and || f(x) — M| are less than 
€/2 so that, using the triangle inequality, we have 


IL — MI = II — f(x) + (f(x) — M)|I 


€ € 
< ||L — f(x)|| + fŒ — MI < i aes 
This shows that the quantity ||L — M|| can be made arbitrarily small; thus, it 
follows that L — M = 0. Hence, L = M. E 


Proof of Theorem 2.5 To establish part 1, note that if limx—a F(x) = L, then 
given any € > 0, we can find a 6; > 0 such that if x € X and 0 < ||x —al| < 
6,, then ||F(x) — L|| < €/2. Similarly, if limy_., G(x) = M, then we can find a 
52 > 0 such that if x € X and 0 < ||x — all < 4), then ||G(x) — M|| < €/2. Now 
let 6 = min(6), 62). Then if x € X and 0 < ||x — all < ô, the triangle inequality 
implies that 


(FQ) + Gx) — (Lb + MI < FŒ -LI + |G) -MI < - + ; =e. 


Hence, lim,_,, (F(x) + G(x)) = L + M. 

To prove part 2, suppose that € > 0 is given. If limy_,, F(x) = L, then we can 
find a 6 > 0 such that if x € X and 0 < ||x — al] < 4, then |/F(x) — L|| < €/|k|. 
Therefore, 

€ 
Ik| 


which means that limy_,, KF(x) = kL. (Note: Ifk = 0, then part 2 holds trivially.) 
To establish the rule for the limit of a product of scalar-valued functions (part 
3), we will use the following algebraic identity: 


f&a) — LM = (f(x) — L)(g(x) — M) + L(g(x) — M) + M(f (x) — L). 
(4) 
If limy.a f(x) = L, then, given any € > 0, we can find 5; > 0 such that if x € X 
and 0 < ||x —al| < 6), then 


|KF(x) — KL|| = Ik] ||F@x) — Ll < k| — = €, 


|f(x) -— L| < ve. 


Similarly, if limy., g(x) = M, we can find 62 > 0 such that if x € X and 0 < 
|x — al] < 69, then 


|g(x) — M| < Ve. 
Let ô = min(6,, 62). Ifx € X and 0 < ||x — all < ô, then 
(f(x) — L)(g(x) — M)| < Ve - Ve =e. 
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This means that limy-,a( f(x) — L)(g(x) — M) = 0. Therefore, using (4) and parts 
1 and 2, we see that 


lim(f()g(x) — LM) = lim( f(x) — L)(g(x) — M) + L lim(g(x) — M) 
+M lim( f(x) — L) 
=0+0+0=0. 


Since limy-,a f(x)g(x) = limy_,a((f(x)g(x) — LM) + LM), the desired result 
follows from part 1. 
The crux of the proof of part 4 is to show that 


‘ 1 1 
lim —— = —. 
x>a g(x) M 
Once we show this, the desired result follows directly from part 3: 
1 1 L 
tim 0 = tim (fe) 2) = 2 go = = 
xa g(x) a g(x) M M 
Note that 


| 1 1|_ IM-g@) 
g(x) M |Mg(x)| 
and, by the triangle inequality, that 
|M| = |M — g(x) + g(®)| < IM — g(x) + lg@)I- (5) 
If limsa g(x) = M, then, given any € > 0, we can find ô; such that ifx € X and 
0 < ||x —al] < 4), then 
M? 
|g(x) — M| < Da 
We can also find 52 such that if x € X and 0 < ||x — al] < ô2, then |g(x) — M| < 
|M|/2 and, hence, using (5), that 
miw e ț w Ss << o 
< — + |g(x g(x)| > — < —. 
2 2 Is@| IMI 


Now let 6 = min(ô1, 52). If x € X and 0 < ||x — al] < ô, then 


| 1 1}_ IM—s@_ 1 |M -8%l 


g(x) M |Mg(x)| IM] — |g(x)I 
1 2 M 
< — — — e€ =E. 
|M| |M| 2 a 
Proof of Theorem 2.6 Note first that, fori = 1,...,m, 


lf) Lil < VA) LP +++ + Gfin() Ln? = If) Ll. (6) 


If limy_., f(x) = L, then given any € > 0, we can find a ê > 0 such that if x € 
X and 0 < ||x — all < ô, then ||f(x) — L|| < €. Hence, (6) implies that | f;(x) — 
L;| < € fori = 1,...,m, which means that lim, ,, AO) = Li. 

Conversely, suppose that limy-,, f(x) = L; fori = 1,...,m. This means 
that, given any € > 0, we can find, for each i, a 6; > 0 such that if x € X and 
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0 < ||x — all < 4;, then | f(x) — Lil < €/./m. Set ô = min(6), .. 
x € X and 0 < ||x — all < ô, we see that (6) implies 


e? e? e? 
moenie se Say —=€, 
m m m 
Thus, limy_,, f(x) = L. ia 


., Ôm). Then if 


Proof of Theorem 2.8 We must show that the composite function g o f is con- 
tinuous at every point a € X. If a is an isolated point of X, there is nothing to 
show. Otherwise, we must show that lim,_,4(g 0 f)(x) = (g 0 f)(a). 

Given any € > 0, continuity of g at f(a) implies that we can find some y > 0 
such that if y € range f and 0 < ||y — f(a)|| < y then 


ligy) — g(fla))ll < e. 


Since f is continuous at a, we can find some ô > 0 such that if x € X and 0 < 
|x — all < 6, then 


I|f(x) — f(a)ll < y. 
Therefore, if x € X and 0 < ||x — al] < ô, then 


I|g(f(x)) — g(f(a)) || < e€. E 
2.2 Exercises 
In Exercises 1—6, determine whether the given set is open or x? +2xy + y? 
closed (or neither). 13. a) to, 0) x+y 
1. (Œ, y) ER? |1 <x? +y? <4} 14. lim 
x,y)> 2 2 
2. {(x,y)E R2| 1 <x? +y? <4} ee 
2 24 42 x-y 
3. syERI[1< 4 15. im ———~ 
{(x, y) saty <4} et One: 
4. (x,y z) E R| 1 <x? +y? +z? <4} 2 
16. li ——— 
5. {(x,y) ER? | —1 <x < U(x, y) E€ R? |x = 2} (9) 20,0) x2 F y? 
6. {(x, y,z) ER | 1 <x? +y? <4} . x? — xy 
17. lim —— 
Evaluate the limits in Exercises 7—21, or explain why the limit (x.y) O.0.xAy /X — Y 
fails to exist. i i x? — y? —4x+4 
7. im, ‘ oF +2xy ere +2 | EDO x2 + y? — 4x +4 
(x,y,z) (0,0, 
19. lim e“ cosy? — x 
8. lim ly| (x.y. O.V7,1) 
(x,y) (0,0) ,/ x2 + y2 20 lim 2x? + 3y? + 2 
(x + yy "(yc (0.0.0) x? + y? + 2? 
9. im 
(x.y)>(0,0) x2 + y? 21 w XY — XZ + YZ 
ae (x,y,z)>(0,0,0) x2 + y2 + z? 
10. lim PERRE sin @ 
(0.0 x + y + 22. (a) What is lim ——? 
2x? + y? ieee : ( ) 
1141. lim ——— . ; sin(x + y 
iy 24 y2 b) Whatis lim §———? 
PED) AE (P) (y)+(0,0) x+y 
2 2 + 2 H 
12. aa ij Wiatis, m #0 


im = 
ey) x? + y? 


(x,y)>(0,0) xy 


23. Examine the behavior of f(x, y) = x*y*/(x? + yt» 
as (x, y) approaches (0, 0) along various straight lines. 
From your observations, what might you conjecture 
lim, y)+(0,0) f(x, y) to be? Next, consider what hap- 
pens when (x, y) approaches (0,0) along the curve 
x = y*. Does lim, y)+(,0) f(x, y) exist? Why or why 
not? 
In Exercises 24-27, (a) use a computer to graph z = f(x, y); 
(b) use your graph in part (a) to give a geometric discussion 
as to whether lima y)+(0,0) f (x, y) exists; (c) give an analytic 
(i.e., nongraphical) argument for your answer in part (b). 


4x? + 2xy + 5y? 


24. f(x,y) = 
© fœ y) 3x? + Sy? 
@ 25 f(x ss 
: 2i -2y 
5 
xy 
Q 26. FO = ayy 
l a 
xsin-— ify#0 
Q 27. fœ) = y 
0 ify =0 


Some limits become easier to identify if we switch to a different 
coordinate system. In Exercises 28—33 switch from Cartesian to 
polar coordinates to evaluate the given limits. In Exercises 34— 
37 switch to spherical coordinates. 


xy 

28. lim ——— 

(x.y)>(0,0) x? + y? 
2 


@,y)=>0,0) x? + y? 


P4xyty 


30. lim - 
(x.y)(0,0) x? + y? 


x5 + yt — 3x3 y + 2x? + 2y? 


31. = lim ag 


(x,y) (0,0) 


2 2 
P xX — 

32. lim a 
(x,y) (0,0) J2 +y 


; x+y 
im = —— 
(x,y) (0,0) / x2 + y? 


lim wy 
y.) > (0,0,0) x2 F y? +22 


33. 


34. 


35. imo Ss o 
(x,y,z) >(0,0,0) x2 + y2 + z? 
2 2 
. x? + 
36. lim ee 
(x.y) (0.0.0) /x2 4 y2 4 2? 
XZ 
37. 


lim os 
(x.y.2)+(0,0,0) x2 + y? + z? 
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In Exercises 38—45, determine whether the functions are con- 
tinuous throughout their domains: 
38. f(x, y) =x? +2xy— y 
39. f(x,y,z) =x + ~ +yz? +2 


40. g(x, y) = oi 


41. h(x, y) = cos a(i 


e) 


42. f(x, y) = cos? x — 2sin? xy 
x? =y 

43. fœ, y= {2+ 
0 if (x, y) = (0, 0) 


if (x, y) # (0, 0) 


HLHH? 
44. g(x,y) = x? +y? 
2 if(x,y)= 


if (x, y) # (0, 0) 


(0, 0) 


my 
45. F(x, y, z) = | x? +3xy, als , sin 2 
2x? + yt +3 y? +l 


46. Determine the value of the constant c so that 


x? + xy? + 2x? + 2y? 
x2 + y2 
c if(x,y)= 


if (x, y) # (0, 0) 
g(x,y) = 


(0, 0) 
is continuous. 


47. Show that the function f:R*® — R given by f(x) = 
(2i — 3j + k) «x is continuous. 


48. Show that the function f: R? > R? given by f(x) = 
(6i — 5k) x x is continuous. 
Exercises 49-53 involve Definition 2.2 of the limit. 
49. Consider the function f(x) = 2x — 3. 
(a) Show that if |x — 5| < ô, then | f(x) — 7| < 26. 
(b) Use part (a) to prove that lim,_,5 f(x) = 7. 
50. Consider the function f(x, y) = 2x — 10y +3. 


(a) Show that if ||(x, y) — (5, 1)|| < ô, then |x — 5| < 
ô and |y — 1| < ô. 


(b) Use part (a) to show that if ||(x, y)— (5, DI| < 6, 
then | f(x, y) — 3| < 126. 
(c) Show that lim¢.,y) (5,1) FŒ, y) = 3. 
51. IfA, B,andC are constants and f(x, y) = Ax + By + 
C, show that 
lim f(x, y) = fo, yo) = Axo + Byo + C. 


(x,y) (x0, Yo) 
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52. In this problem, you will establish rigorously that 
ety 
im ——~=0. 
(x.y) (0,0) x2 + y? 

(a) Show that |x| < |Œ, yl] and |y] < IŒ, y). 

(b) Show that |x? + y3| < 2(x? + y*)3/?. (Hint: Begin 
with the triangle inequality, and then use part (a).) 

(c) Show that if 0<||(x, y)|| <6, then |(x? + y3)/ 
(x? + y?)| < 28. 

(d) Now prove that limg y=0,0 + y3)/(? + 

A J 

y)=0. 


53. (a) Ifa and b are any real numbers, show that 2|ab| < 
a+b’. 
(b) Let 
vay? 
fœ, y)=xy (=>) 
Use part (a) to show that if 0 < || (x, y)|| < ô, then 
If(, y)| < 67/2. 


(c) Prove that lim(,,y) (0,0) f(x, y) exists, and find its 
value. 


2.3 The Derivative 


Our goal for this section is to define the derivative of a function f: X C R” —> R”, 
where n and m are arbitrary positive integers. Predictably, the derivative of a 
vector-valued function of several variables is a more complicated object than the 
derivative of a scalar-valued function of a single variable. In addition, the notion of 
differentiability is quite subtle in the case of a function of more than one variable. 

We first define the basic computational tool of partial derivatives. After do- 
ing so, we can begin to understand differentiability via the geometry of tangent 
planes to surfaces. Finally, we generalize these relatively concrete ideas to higher 


dimensions. 


Partial Derivatives 


Recall that if F: X C R —> Ris ascalar-valued function of one variable, then the 
derivative of F ata number a € X is 


F'(a) = lim 


F(a +h) — F(a) 


7 (1) 


Moreover, F is said to be differentiable at a precisely when the limit in equation 


(1) exists. 


DEFINITION 3.1 


Suppose f: X C R” — Risascalar-valued function ofn 
variables. Let x = (x1, X2,..., Xn) denote a point of R”. A partial function 
F with respect to the variable x; is a one-variable function obtained from 
f by holding all variables constant except x;. That is, we set x; equal to a 
constant a; for j Æ i. Then the partial function in x; is defined by 


IPG) = Ff Gils O espa 009 lak 


EXAMPLE 1 If f(x, y) = (x? — y?)/(x? + y”), then the partial functions with 


respect to x are given by 


F(x) = f( ae 
x)= f(x,@)= , 
4 x? +a? 


where az may be any constant. If, for example, az = 0, then the partial function is 


2 


X 
F(x) = f@,0=>=1. 


Domain of f 


Domain of F 
(restriction 


of f) 


Figure 2.46 The function f 
Example 1 is defined on R? — 


{(0,0)}, while its partial function F 


along y = 0 is defined on the 
X-axis minus the origin. 


zZ 


x 


Figure 2.47 Visualizing the 
partial derivative a, b). 


x 


Figure 2.48 Visualizing the 
partial derivative (a, b). 


of 
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Geometrically, this partial function is nothing more than the restriction of f to 
the horizontal line y = 0. Note that since the origin is not in the domain of f, 0 
should not be taken to be in the domain of F. (See Figure 2.46.) + 


REMARK In practice, we usually do not go to the notational trouble of explic- 
itly replacing the x;’s (j # i) by constants when working with partial functions. 
Instead, we make a mental note that the partial function is obtained by allowing 
only one variable to vary, while all the other variables are held fixed. 


DEFINITION 3.2 The partial derivative of f with respect to x; is the 
(ordinary) derivative of the partial function with respect to x;. That is, the 
partial derivative with respect to x; is F’(x;), in the notation of Definition 
3.1. Standard notations for the partial derivative of f with respect to x; are 
af 
Ox; j 
Symbolically, we have 
of ne HESipesane harm acenlcn — if Cklpooandon) 
— = lim : 
Ox; hoo h 


D f{Chovecgdha and jf (Chg 00 og d85)): 


(2) 


By definition, the partial derivative is the (instantaneous) rate of change of f 
when all variables, except the specified one, are held fixed. In the case where f 
is a (scalar-valued) function of two variables, we can understand 
af 
(a,b 
ax (a:b) 

geometrically as the slope at the point (a, b, f(a, b)) of the curve obtained by 


intersecting the surface z = f(x, y) withthe plane y = b, as shown in Figure 2.47. 
Similarly, 


af 
ay b) 


is the slope at (a, b, f(a, b)) of the curve formed by the intersection of z = f(x, y) 
and x = a, shown in Figure 2.48. 


EXAMPLE 2 For the most part, partial derivatives are quite easy to compute, 
once you become adept at treating variables like constants. If 


f(x, y) = x’y + cos(x + y), 


then we have 


af : 
— = 2xy — sin(x + y). 
Ox 


(Imagine y to be a constant throughout the differentiation process.) Also 


of = x? — sin(x + y). 
dy 
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y 


(a, F(a)) 


Figure 2.49 The tangent line to 
y = F(x) at x = a has equation 
y = F(a) + F'(a)(x — a). 


(Imagine x to be a constant.) Similarly, if g(x, y) = xy/(x? + y’), then, from the 
quotient rule of ordinary calculus, we have 
@?+y)y—xyQ@x) _ yy? x) 

(x2 ae yy? (x? m y2} ? 


8x(X, y) = 


and 
(x? + y?)x —xy(2y) _ x(x? — y’) 
(x? + y? ery 
Note that, of course, neither g nor its partial derivatives are defined at (0,0). 


8y(xX, y= 


EXAMPLE 3 Occasionally, it is necessary to appeal explicitly to limits to eval- 
uate partial derivatives. Suppose f: R? — R is defined by 


3x2y — y? 
f(y = ae if (x, y) # (0, 0) 


0 if (x, y) = (0, 0) 
Then, for (x, y) Æ (0, 0), we have 


af 8xy> j af 3x4 — 6x?y? — yt 
= an = 
dx +y? dy G+? 
af af E 
But what should an 0) and ay 0) be? To find out, we return to Definition 
x Y 


3.2 of the partial derivatives: 
of _ fO+h,0)— f0,0) ~ 0-0 


— = 1 — = 
ax) pen h N h i 


and 
of fO,O+h)- FO) _ —h—0 


ay o) = ht h h->0 h h->0 i + 


Tangency and Differentiability 


If F: X C R > R is a scalar-valued function of one variable, then to have F 
differentiable at a number a € X means precisely that the graph of the curve 
y = F(x) has a tangent line at the point (a, F(a)). (See Figure 2.49.) Moreover, 
this tangent line is given by the equation 


y = F(a) + F'(a)& — a). (3) 
If we define the function H(x) to be F(a) + F’(a)(x — a) (i.e., H(x) is the right 


side of equation (3) that gives the equation for the tangent line), then H has two 
properties: 


1. H(a) = F(a) 
2. H'(a) = F'(a). 


In other words, the line defined by y = H(x) passes through the point (a, F (a)) 
and has the same slope at (a, F'(a)) as the curve defined by y = F(x). (Hence, 
the term “tangent line.”) 

Now suppose f: X C R? — R isa scalar-valued function of two variables, 
where X is open in R?. Then the graph of f is a surface. What should the tangent 
plane to the graph of z = f(x, y) at the point (a, b, f(a, b)) be? Geometrically, 


(a, b, f(a, b)) 


x), 


Figure 2.50 The plane tangent 
toz = f(x, y) at 
(a,b, f(a, b)). 


Figure 2.51 The tangent plane at 
(a, b, f(a, b)) contains the lines 
tangent to the curves formed by 
intersecting the surface 

z= f(x, y) by the planes x = a 
and y = b. 


2.3 | The Derivative 119 


the situation is as depicted in Figure 2.50. From our earlier observations, we 
know that the partial derivative fx(a, b) is the slope of the line tangent at the 
point (a, b, f(a, b)) to the curve obtained by intersecting the surface z = f(x, y) 
with the plane y = b. (See Figure 2.51.) This means that if we travel along this 
tangent line, then for every unit change in the positive x-direction, there’s a change 
of f(a, b) units in the z-direction. Hence, by using formula (1) of §1.2, the tangent 
line is given in vector parametric form as 


h(t) = (a, b, f(a, b)) + tC, 0, fx(a, b)). 
Thus, a vector parallel to this tangent line is 
u=i+ fi(a, b)k. 


Similarly, the partial derivative f(a, b) is the slope of the line tangent at the point 
(a, b, f(a, b)) to the curve obtained by intersecting the surface z = f(x, y) with 
the plane x = a. (Again see Figure 2.51.) Consequently, the tangent line is given 
by 

b(t) = (a, b, f(a, b)) + 1, 1, fy, b)), 


so a vector parallel to this tangent line is 
v=j+ fy(a,b)k. 


Both of the aforementioned tangent lines must be contained in the plane tangent 
to z = f(x, y) at (a,b, f(a, b)), if one exists. Hence, a vector n normal to the 
tangent plane must be perpendicular to both u and v. Therefore, we may take n 
to be 


n=uxv= -—f(a,b)i— fy(a,b)j+ k. 


Now, use equation (1) of §1.5 to find that the equation for the tangent plane—that 
is, the plane through (a, b, f(a, b)) with normal n—is 


(— f(a, b), —fy(a, b), 1)- (x = 4, yY > b, c= f(a, b)) =0 


or, equivalently, 


— fx(a, b)(x — a) — fy(a, by — b)+ z — f(a, b) = 0. 


By rewriting this last equation, we have shown the following result: 


THEOREM 3.3 If the graph of z= f(x,y) has a tangent plane at 
(a, b, f(a, b)), then that tangent plane has equation 


z= f(a, b)+ frla, bx — a) + fy(a, b) — b). (4) 


Note that if we define the function h(x, y) to be equal to f(a, b) + fi(a, b\(x — 
a)+ fy(a, b\(y — b) (ie., h(x, y) is the right side of equation (4)), then A has the 
following properties: 


1. h(a, b) = f(a, b) 
ah _ af ah _ af 
2. ODRO and 5 (a,b) = 5 (a,b). 


In other words, h and its partial derivatives agree with those of f at (a, b). 
It is tempting to think that the surface z = f(x, y) has a tangent plane at 
(a, b, f(a, b)) as long as you can make sense of equation (4), that is, as long as the 
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Z 


A 
J 


Figure 2.52 If two points 
approach (0, 0, 0) while remaining 
on one face of the surface 
described in Example 4, the 
limiting plane they and (0, 0, 0) 
determine is different from the one 
determined by letting the two 
points approach (0, 0, 0) while 
remaining on another face. 


partial derivatives fx(a, b) and f(a, b) exist. Indeed, this would be analogous to 
the one-variable situation where the existence of the derivative and the existence 
of the tangent line mean exactly the same thing. However, it is possible for a 
function of two variables to have well-defined partial derivatives (so that equation 
(4) makes sense) yet not have a tangent plane. 


EXAMPLE 4 Let f(x, y) = |lx|— lyl| — |x| — |y| and consider the surface 
defined by the graph ofz = f(x, y) shown in Figure 2.52. The partial derivatives 
of f at the origin may be calculated from Definition 3.2 as 
FO+h, 0) — FO, 0 im IZ IA 
= hn ——— = 
h h-0 h->0 


fx(0, 0) = tim 


and 

KOO = fi PIRES = i, = m0 = 0 
(Indeed, the partial functions F(x) = f(x, 0) and G(y) = f(0, y) are both identi- 
cally zero and, thus, have zero derivatives.) Consequently, ifthe surface in question 
has a tangent plane at the origin, then equation (4) tells us that it has equation 
z = 0. But there is no geometric sense in which the surface z = f(x, y) has a 
tangent plane at the origin. If we think of a tangent plane as the geometric limit of 
planes that pass through the point of tangency and two other “moving” points on 
the surface as those two points approach the point of tangency, then Figure 2.52 
shows that there is no uniquely determined limiting plane. + 


Example 4 shows that the existence of a tangent plane to the graph of 
z = f(x, y) is a stronger condition than the existence of partial derivatives. It 
turns out that such a stronger condition is more useful in that theorems from the 
calculus of functions of a single variable carry over to the context of functions 
of several variables. What we must do now is find a suitable analytic definition of 
differentiability that captures this idea. We begin by looking at the definition of 
the one-variable derivative with fresh eyes. 

By replacing the quantity a + h by the variable x, the limit equation in formula 
(1) may be rewritten as 


F'(a) = lim F(x) — F@ 
xa x —_a 


This is equivalent to the equation 


lim (“2 = nt) F'(a)=0. 


x>a x—a 
The quantity F'(a) does not depend on x and therefore may be brought inside the 
limit. We thus obtain the equation 
F(x)— F 

lim pa f — Fa} = 0. 

xa x—a 
Finally, some easy algebra enables us to conclude that the function F is differen- 
tiable at a if there is a number F'(a) such that 


i F(x) -[F@+F'@e—a] _ 
im = 


xa x-a 


0. (5) 


What have we learned from writing equation (5)? Note that the expression in 
brackets in the numerator of the limit expression in equation (5) is the function 


(x, F(x)) 
(a, F(a)) 


Figure 2.53 If F is differentiable 
at a, the vertical distance between 
F(x) and H(x) must approach 
zero faster than the horizontal 
distance between x and a does. 
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H (x) that was used to define the tangent line to y = F(x) at (a, F(a)). Thus, we 
may rewrite equation (5) as 
_ Fx) — H(x) 

lim ———— = 


xa x—a 


0. 


For the limit above to be zero, we certainly must have that the limit of the numerator 
is zero. But since the limit of the denominator is also zero, we can say even more, 
namely, that the difference between the y-values of the graph of F and ofits tangent 
line must approach zero faster than x approaches a. This is what is meant when 
we say that “H is a good linear approximation to F near a.” (See Figure 2.53.) 
Geometrically, it means that, near the point of tangency, the graph of y = F(x) 
is approximately straight like the graph of y = H (x). 

If we now pass to the case of a scalar-valued function f(x, y) of two variables, 
then to say that z = f(x, y) has a tangent plane at (a, b, f(a, b)) i.e., that f is 
differentiable at (a, b)) should mean that the vertical distance between the graph 
of f and the “candidate” tangent plane given by 


z = h(x, y)= f(a, b) + frla, bx — a) + fy(a, b) — b) 


must approach zero faster than the point (x, y) approaches (a, b). (See Fig- 
ure 2.54.) In other words, near the point of tangency, the graph of z = f(x, y) 1s 
approximately flat just like the graph ofz = h(x, y). We can capture this geometric 
idea with the following formal definition of differentiability: 


DEFINITION 3.4 Let X be open in R? and f: X C R? > R be a scalar- 
valued function of two variables. We say that f is differentiable at (a, b) € X 
if the partial derivatives f,(a, b) and f(a, b) exist and if the function 


h(x, y)= f(a, b) + fx(a, b)x — a) + fy, b) — b) 
is a good linear approximation to f near (a, b)—that is, if 
F(@sy) — AG, y) _ 
(yb) |[(x, y) — (a, b)|| 
Moreover, if f is differentiable at (a, b), then the equation z = h(x, y) de- 
fines the tangent plane to the graph of f at the point (a, b, f(a, b)). If f 


is differentiable at all points of its domain, then we simply say that f is 
differentiable. 


(x, y, f(x, y)) 


(x,y, h(x, y)) 


(a,b, f(a, b)) 
x 
Figure 2.54 If f is differentiable at (a, b), the distance 


between f(x, y) and h(x, y) must approach zero faster than 
the distance between (x, y) and (a, b) does. 
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EXAMPLE 5 Let us return to the function f(x, y) = ||x| — |y|| — |x| — |y| of 
Example 4. We already know that the partial derivatives f,(0,0) and f,(0, 0) 
exist and equal zero. Thus, the function h of Definition 3.4 is the zero function. 
Consequently, f will be differentiable at (0,0) just in case 


fæ- m FY) 
&»>0.0) |x, y)— (0, 9)|| > .9) |Œ, y) 


Ixl — IIl — lx] — Iyl 


lim 
(x,y)—(0,0) / x2 + y2 


is zero. However, it is not hard to see that the limit in question fails to exist. Along 
the line y = 0, we have 


fœ y) _ Wel= Ol = lel 101 | 0 
IIx, y) Vx? jx] 


but along the line y = x, we have 


fœ») _ ll- kll -ll lel z2x J 
Ix, y) VXL + x2 J2\x| i 


Hence, f fails to be differentiable at (0, 0) and has no tangent plane at (0, 0, 0).¢ 


The limit condition in Definition 3.4 can be difficult to apply in practice. 
Fortunately, the following result, which we will not prove, simplifies matters in 
many instances. Recall from Definition 2.3 that the phrase “a neighborhood of 
a point P ina set X” just means an open set containing P and contained in X. 


THEOREM 3.5 Suppose X is open in R°. If f: X — R has continuous partial 
derivatives in a neighborhood of (a, b) in X, then f is differentiable at (a, b). 


A proof of a more general result (Theorem 3.10) is provided in the addendum 
to this section. 


EXAMPLE 6 Let f(x,y) =x? +2y?. Then df/dx = 2x and df/dy = 4y, 
both of which are continuous functions on all of R?. Thus, Theorem 3.5 implies 
that f is differentiable everywhere. The surface z = x? + 2y must therefore 
have a tangent plane at every point. At the point (2, —1), for example, this tangent 
plane is given by the equation 


z7=6+4(« —2)-4(y +1) 
(or, equivalently, by 4x — 4y — z = 6). + 


While we’re on the subject of continuity and differentiability, the next result is 
the multivariable analogue of a familiar theorem about functions of one variable. 


THEOREM 3.6 If f:X C R? > Ris differentiable at (a, b), then it is continu- 
ous at (a, b). 
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EXAMPLE 7 Let the function f: R? — R be defined by 


fa y) =] x4 +y if (x, y) # (0, 0) 
0 — if(x, y) = (0,0) 


The function f is not continuous at the origin, since lim, y).0,0) f(x, y) does 
not exist. (However, f is continuous everywhere else in R^.) By Theorem 3.6, f 
therefore cannot be differentiable at the origin. Nonetheless, the partial derivatives 
of f do exist at the origin, and we have 


0 of 
= ——_ = —(0,0) = 0, 
fe. 0= a5 =0 => 50.0) 
and 
0 of 
0, y) = —— = 0 —(0,0) = 0, 
fON= Fp = => 7,00 


since the partial functions are constant. Thus, we see that if we want something 
like Theorem 3.6 to be true, the existence of partial derivatives alone is not 
enough. + 


Differentiability in General 


It is not difficult now to see how to generalize Definition 3.4 to three (or more) 
variables: For a scalar-valued function of three variables to be differentiable at a 
point (a, b, c), we must have that (i) the three partial derivatives exist at (a, b, c) 
and (ii) the function A: R? —> R defined by 


h(x, y, z) = f(a, b, c) TT f(a, b, c)(x a a) 
ag fyla, b, cyy = b) + fla, b, cz B c) 
is a good linear approximation to f near (a, b, c). In other words, (ii) means that 


lim fœ, yz) = AC yz) _ 
(x,y,z) > (a,b,c) Il(x, ys z) _ (a, b, c)Il 


The passage from three variables to arbitrarily many is now straightforward. 


DEFINITION 3.7 Let X be open in R” and f: X — R be a scalar-valued 
function; let a = (a1, a2, ..., an) E€ X. We say that f is differentiable at 
a if all the partial derivatives f,,(a), i = 1,...,n, exist and if the function 
h: R” — R defined by 


h(x) = f(a)+ ENNE — a1) + ENa — a2) 


test Sanaan — an) (6) 
is a good linear approximation to f near a, meaning that 
—h 
Se SE 


xa ||x — all 
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We can use vector and matrix notation to rewrite things a bit. Define the 
gradient of a scalar-valued function f: X C R” — R to be the vector 


af af =) 


Ox) 0x2’? OXp 


vfa) = ( 
Consequently, 


V f(a) = (fr (a), fala), «+» fr, A). 


Alternatively, we can use matrix notation and define the derivative of f at a, 
denoted Df (a), to be the row matrix whose entries are the components of V f(a); 
that is, 


DFD = [A f(a) --- fr,(a)]. 


Then, by identifying the vector x — a with then x 1 column matrix whose entries 
are the components of x — a, we have 


Xi —a 

na 
V f(a)-(x—a) = DfAA- a) = [A f(a) --- fr,(a)] 

Xn — An 


= faa) — a1) + faa) — a2) 
resia A (a)(xp i an). 


Hence, vector notation allows us to rewrite equation (6) quite compactly as 


h(x) = f(a) + V f(a): -— a). 


Thus, to say that is a good linear approximation to f near a in equation (6) 
means that 
fim O- FAVA- _ 


ae Ix — all 


0. (7) 


Compare equation (7) with equation (5). Differentiability of functions of one and 
several variables should really look very much the same to you. It is worth noting 
that the analogues of Theorems 3.5 and 3.6 hold in the case of n variables. 

The gradient of a function is an extremely important construction, and we 
consider it in greater detail in §2.6. 

You may be wondering what, if any, geometry is embedded in this general 
notion of differentiability. Recall that the graph of the function f: X C R” > R 
is the hypersurface in R"*! given by the equation x,4; = f(x1,%2,..., Xn). 
(See equation (2) of §2.1.) If f is differentiable at a, then the hypersurface deter- 
mined by the graph has a tangent hyperplane at (a, f (a)) given by the equation 


Xn+1 = h(x, X2, ..., Xn) = f(a) + Vf (a) (x — a) 
= f(a) + Df(a)(x — a). (8) 


Compare equation (8) with equation (3) for the tangent line to the curve y = F(x) 
at (a, F(a)). Although we cannot visualize the graph of a function of more than 
two variables, nonetheless, we can use vector notation to lend real meaning to 
tangency in n dimensions. 


EXAMPLE 8 Before we drown in a sea of abstraction and generalization, let’s 
do some concrete computation. An example of an “n-dimensional paraboloid” in 
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R"*! is given by the equation 


2 2 2 
Xn41 = XL XZ +e +X, 


that is, by the graph of the function f(x,,...,X,)) = x? + Xe ++ x. We have 


so that 
VF (ise csa Xn) = 6 20d ose 2a) 

Note that the partial derivatives of f are continuous everywhere. Hence, the 
n-dimensional version of Theorem 3.5 tells us that f is differentiable everywhere. 
In particular, f is differentiable at the point (1, 2,..., 7), 

Vf, 2,...,2) =(2,4,..., 2), 
and 
DfA,2,...,n)=[2 4 ae 2n]. 
Thus, the paraboloid has a tangent hyperplane at the point 
ht ApEn) 


whose equation is given by equation (8): 


xı— l 
x2 — 2 
Xn = (1+2 4---+n°)+[2 4 oo 2n] 
Xn — n 
= (1? +2 +- +n M 1) a — 2 Hana n) 
= (1° I por n?) 2x1 + 4x2 +- + 2NXn 
(2-1+4:-2+.--+2n-n) 
= 2x1 + 4x2 +- + 2nxn — (1? +22 +--+ n3) 


L n(n + 1)(2n + 1) 
= > 21x; 5 : 
i=l 
(The formula 1? + 2? + --- +n? = n(n + 1)(2n + 1)/6 is a well-known identity, 
encountered when you first learned about the definite integral. It’s straightforward 
to prove using mathematical induction.) + 


At last we’re ready to take a look at differentiability in the most general setting 
of all. Let X be open in R” and let f: X — R” be a vector-valued function of n 
variables. We define the matrix of partial derivatives of f, denoted Df, to be 


126 


Chapter 2 | Differentiation in Several Variables 


the m x n matrix whose ijth entry is 0f;/0x;, where f;: X C R” — Ris the ith 
component function of f. That is, 


a ft OH 
Ox} OX2 əxa 
Df(x1, X2, ..., Xn) = ðX 3x2 OXy 
Ofm Bfm fn 
Ox] OXx2 əxa 


The ith row of Df is nothing more than D f;—and the entries of Df; are precisely 
the components of the gradient vector V f;. (Indeed, in the case where m = 1, 
V f and Df mean exactly the same thing.) 


EXAMPLE 9 Suppose f: R? —> R? is given by f(x, y, z) = (x cos y + z, xy). 
Then we have 


cosy —x sin 1 
Dats, ».2)= | - Hie aA ‘ 


We generalize equation (7) and Definition 3.7 in an obvious way to make the 
following definition: 


DEFINITION 3.8 (GRAND DEFINITION OF DIFFERENTIABILITY) Let X C 
R” be open, let f: X — R”, and let a € X. We say that f is differentiable at 
aif Df(a) exists and if the function h: R” — R” defined by 


h(x) = f(a) + Df(a)(x — a) 
is a good linear approximation to f near a. That is, we must have 
(f(x) — hæl _ . I f(x) — fa) + Df(ay(x — a] 
lim = lim = 


x>a ||x—all xa Ix — all 


0. 


Some remarks are in order. First, the reason for having the vector length 
appearing in the numerator in the limit equation in Definition 3.8 is so that there 
is a quotient of real numbers of which we can take a limit. (Definition 3.7 concerns 
scalar-valued functions only, so there is automatically a quotient of real numbers.) 
Second, the term Df(a)(x — a) in the definition of h should be interpreted as the 
product of the m x n matrix Df(a) and the n x 1 column matrix 


xX} — ay 
X2 — a2 
La | 


Because of the consistency of our definitions, the following results should 
not surprise you: 
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THEOREM 3.9 If f: X C R” — R” is differentiable at a, then it is continuous 
at a. 


THEOREM 3.10 If f:X c R” > R” is such that, for i=1,...,m and 
j=1,...,n, all 0f;/dx; exist and are continuous in a neighborhood of a in 
X, then f is differentiable at a. 


THEOREM 3.11 A function f: X C R” — R” is differentiable at a € X (in the 
sense of Definition 3.8) if and only if each of its component functions f;: X C 
R” > R,i = 1,...,m, is differentiable at a (in the sense of Definition 3.7). 


The proofs of Theorems 3.9, 3.10, and 3.11 are provided in the addendum 
to this section. Note that Theorems 3.10 and 3.11 frequently make it a straight- 
forward matter to check that a function is differentiable: Just look at the partial 
derivatives of the component functions and verify that they are continuous. Thus, 
in many—but not all—circumstances, we can avoid working directly with the 
limit in Definition 3.8. 


EXAMPLE 10 The function g: R? — {(0, 0, 0)} —> R? given by 


3 
g(x, Y, z) = (sa xz) 


has 
—6x —6y —6z 
Gy ter eee ye? 
Dg(x, y, Zz) = 
y x 0 
Zz 0 x 


Each of the entries of this matrix is continuous over R? — {(0, 0, 0)}. Hence, by 
Theorem 3.10, g is differentiable over its entire domain. + 


What Is a Derivative? - 


Although we have defined quite carefully what it means for a function to be 
differentiable, the derivative itself has really taken a “backseat” in the preceding 
discussion. It is time to get some perspective on the concept of the derivative. 

In the case of a (differentiable) scalar-valued function of a single variable, 
f:X C R — R, the derivative f'(a) is simply a real number, the slope of the 
tangent line to the graph of f at the point (a, f(a)). From a more sophisticated 
(and slightly less geometric) point of view, the derivative f'(a) is the number such 
that the function 


A(x) = f(a) + f'a — a) 


is a good linear approximation to f(x) for x near a. (And, of course, y = h(x) is 
the equation of the tangent line.) 

If a function f: X C R” > R of n variables is differentiable, there must 
exist n partial derivatives 0f/0x1,..., 0f/0Xn. These partial derivatives form the 
components of the gradient vector V f (or the entries of the 1 x n matrix Df). It 
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is the gradient that should properly be considered to be the derivative of f, but in 
the following sense: V f(a) is the vector such that the function h: R” —> R given 
by 


h(x) = f(a) + V f(a): (x— a) 


is a good linear approximation to f(x) for x near a. Finally, the derivative of a 
differentiable vector-valued function f: X C R” —> R” may be taken to be the 
matrix Df of partial derivatives, but in the sense that the function h: R” > R” 
given by 


h(x) = f(a) + Df(a)(x — a) 


is a good linear approximation to f(x) near a. You should view the derivative 
Df(a) not as a “static” matrix of numbers, but rather as a matrix that defines a 
linear mapping from R” to R”. (See Example 5 of $1.6.) This is embodied in 
the limit equation of Definition 3.8 and, though a subtle idea, is truly the heart of 
differential calculus of several variables. 

In fact, we could have approached our discussion of differentiability much 
more abstractly right from the beginning. We could have defined a function f: X C 
R” — R” to be differentiable at a point a € X to mean that there exists some 
linear mapping L: R” — R” such that 

lim © [fl@) + Le — adil _ 
im = 0; 


oa Ix — all 


Recall that any linear mapping L: R” — R” is really nothing more than multipli- 
cation by a suitable m x n matrix A (i.e., that L(y) = Ay). It is possible to show 
that if there is a linear mapping that satisfies the aforementioned limit equation, 
then the matrix A that defines it is both uniquely determined and is precisely the 
matrix of partial derivatives Df(a). (See Exercises 60—62 where these facts are 
proved.) However, to begin with such a definition, though equivalent to Definition 
3.8, strikes us as less well motivated than the approach we have taken. Hence, we 
have presented the notions of differentiability and the derivative from what we 
hope is a somewhat more concrete and geometric perspective. 


Addendum: Proofs of Theorems 3.9, 3.10, and 3.11 


Proof of Theorem 3.9 We begin by claiming the following: Let x € R” and 
B = (b;;) be an m x n matrix. If y = Bx, (so y € R”), then 


Iyl < KIx|, (9) 


1/2 
where K = (= j p?) . We postpone the proof of (9) until we establish the 
main theorem. 

To show that f is continuous at a, we will show that ||f(x) — f(a)|| —> 0 as 
x — a. We do so by using the fact that f is differentiable at a (Definition 3.8). 

We have 

|| f(x) — f(a)|| = ||f(x) — f(a) — Df(a)(x — a) + Df(a)(x — a)l] 
< ||f(x) — f(a) — Dfa) — a)|| + || Dax — all, (10) 


using the triangle inequality. Note that the first term in the right side of inequality 
(10) is the numerator of the limit expression in Definition 3.8. Thus, since f is 
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differentiable at a, we can make ||f(x) — f(a) — Df(a)(x — a)|| as small as we wish 
by keeping ||x — all appropriately small. In particular, 


|| f(x) — f(a) — Df(a)(x — a)l] < IIx — all 


if ||x — al] is sufficiently small. To the second term in the right side of inequality 
(10), we may apply (9), since Df(a) is an m x n matrix. Therefore, we see that if 
||x — al] is made sufficiently small, 


\|f(x) — f(a)|| < |x — al] + K||x — all = (1+ K)||x — all. (11) 
The constant K does not depend on x. Thus, as x — a, we have 
I|f(x) — f(a)|| > 0, 


as desired. 
To complete the proof, we establish inequality (9). Writing out the matrix 
multiplication, 


biixı + bi2x2 +--+ + Dinxn bi -x 
by1x1 + b22x2 + +++ + DonXn b2 +x 

y = Bx= l =|, 
bDmixı T bm2X2 a DnnXn Din “xX 


where b; denotes the ith row of B, considered as a vector in R”. Therefore, using 
the Cauchy—Schwarz inequality, 


llyll = ((b . x) + (b2 “xy Ae aden? (Dn <P)" 


1/2 
< (Ibix? + lo |)? x]? + = + lbn Px) 


= (lb? + [Nba l|? + ==» + [bmll?) 7 x. 
Now, 
IIb; |? = Bj, + Big +--+ +57, = yi. 
j=l 
Consequently, 
Ibil? + b21? + ++ + + lbn l? = 5 IIb; ||? = s S =K. 
i=1 i=l j=l 

Thus, |ly|| < K||x|], and we have completed the proof of Theorem 3.9. E 


Proof of Theorem 3.10 First, we prove Theorem 3.10 for the case where f is a 
scalar-valued function of two variables. We begin by writing 


f1, x2) — f(a, a2) = f(x1, x2) — f(a, x2) + f(a, x2) — f(a, a2). 


By the mean value theorem,” there exists a number c; between a; and xı such 
that 


fœ, x2) — f(a, x2) = fa (C1, x2)(%1 — a1) 


2 Recall that the mean value theorem says that if F is continuous on the closed interval [a, b] and differen- 
tiable on the open interval (a, b), then there is a number c in (a, b) such that F(b) — F(a) = F'(c)(b — a). 


130 Chapter 2 | Differentiation in Several Variables 


and a number cz between a2 and x2 such that 


f(a, x2) — f(a, a2) = fo (a1, €2)(%2 — a2). 


(This works because in each case we hold all the variables in f constant except 
one, so that the mean value theorem applies.) Hence, 


| fx, x2) — f(a, a2) — fa (a1, a2) — a1) — fola, a2)(X2 — a )| 
= | fa (c1, X2)&1 — a1) + folar, C2)(X2 — a2) — fx (a1, a2)(X1 — a1) 
— fx (a1, a2)(x2 — a2)| 


< [Fa (Cr, x) — a1) — fr (a1, a) — a )| 


’ 


+ | folar, €2)(X2 — a2) — falai, a2)(%2 — a) 
by the triangle inequality. Hence, 


| f(x, x2) — far, a2) — fe (ai, a2)(%1 — a1) — folar, 2)(%2 — a2)| 
< | Fa (c1, ¥2) — fa (a1, @2)| x1 — | 
+ | f(a, €2) — fr (a1, @2)| [x2 — al 


< {| falci, x2) — fa (ai, @)| + | f(a, c2) — falai, a2)|} IIx — all, 
since, for i = 1, 2, 


lx; — a;l < lx — all = (œ — a1}? + — a)”. 


Thus, 
|f@1, x2) — f(a, a2) — fx (Qi, a2)@1 — a1) — fe (a1, a2)(x2 — a )| 
I|x — all 
< | fe (C1, x2) — fa (a1, a2)| + | fe (a1, 2) — folar, a2)). (12) 


As x —> a, we must have that c; —> a;, fori = 1, 2, since c; is between a; and xi. 
Consequently, by the continuity of the partial derivatives, both terms of the right 
side of (12) approach zero. Therefore, 


lim | f(x, x2) — fai, a2) — fe, (a1, x1 — a1) — fa (a1, a2)(%2 — a)| z 


r>a Ix — all 


0 


as desired. 

Exactly the same kind of argument may be used in the case that f is a scalar- 

valued function of n variables—the details are only slightly more involved, so 

we omit them. Granting this, we consider the case of a vector-valued function 
f: R” — R”. According to Definition 3.8, we must show that 

lim |) — fla) — Pia) — adil _ 

im = 


aid Ix — all 


0. (13) 


The component functions of the expression appearing in the numerator may be 
written as 


Gi = fi(x) — fila) — Dfi(a)(x — a), (14) 


where f;,i = 1,...,m, denotes the ith component function of f. (Note that, by 
the cases of Theorem 3.10 already established, each scalar-valued function f; is 
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differentiable.) Now, we consider 


Ifœ - fla) - Dfa) = all _ (G1, G2; .--, Gm 
Ix — all Ix — all 


(G+ Ge 4 a2)" 


lx — all 
(Gilt [Gal +: +: + 1G 
T I|x — all 
= |G] | i [G2] Dorad [Gm] 
lx— all |x — all I|x — all 


As x — a, each term |G;|/||x — a|| —> 0, by definition of G; in equation (14) and 
the differentiability of the component functions f; off. Hence, equation (13) holds 
and f is differentiable at a. (To see that (Gi peep Gy? < [Gil +--+ IGal, 
note that 


MGi + +1Gml) = |GiP H Gnl? 
+2|Gi||G2| + 21G1| [G3] +--+ + 21Gm—11 Gnl 
SIG Ferta 
Then, taking square roots provides the inequality.) E 


Proof of Theorem 3.11 In the final paragraph of the proof of Theorem 3.10, we 
showed that 


I(x) - fla) - DDR -DI IG, IG! Gal 
|x — all ~ |ix—al Ix — all I|x — all’ 
where G; = f;(x) — f;(a) — Df;(a)(x — a) as in equation (14). From this, it fol- 
lows immediately that differentiability of the component functions fi, ..., fin at 
a implies differentiability of f at a. Conversely, fori = 1,...,m, 
Ifœ — fa) - Daya -a)l _ |(Gi, G2,- - -> Gm)ll 5 _IG;l 
|x — all |x — all ~ lx- all 


Hence, differentiability of f at a forces differentiability of each component 
function. m 


In Exercises 1—9, calculate 0f/0x and of /dy. 6. f(x, y) = In(x? + y?) 
1. f(x,y) = xy? +37y 
2. fle, y) =e" 


3. f(x, y) = sinxy + cosxy 


4. f(x, y)= 


5. f(x,y) = 


xe y? 


1+ x? + 3y4 


wy? 


x2 + y? 


7. f(x, y) = cosx?y 
8. fæ») =I (=) 
y 


9. f(x, y) = xe” + ysin(x? + y) 


In Exercises 10-17, evaluate the partial derivatives 0F /0x, 
dF /dy, and 0F /0z for the given functions F. 


10. F(x, y,z)=x+3y —2z 
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11. F(x, y, 2) = —— 
. F(x, y, z) = — 
2 y+z 


12. F(x, y, z) = xyz 


13. F(x, y, z) = Vx? +y? +22 

14. F(x, y, z) = e™ cos by + e% sin bx 
X+y+z 

ad + x? + y2 + 22)3/2 


16. F(x, y, z) = sinx*y>z* 


15. F(x, y,z)= 


x3 + yz 
a +l 
Find the gradient V f(a), where f and a are given in Exer- 
cises 18-25. 


17. F(x, y,Z= 


18. f(x,y) =x? y +e, a=(1,0) 
=y 
19. y= A =(2,—1 
fæ») = zry 2=@-D 
20. f(x, y,z)=sinxyz, a= (x, 0,7/2) 
21. f(x,y,z) = xy + y cosz — x sin yz, 
a = (2, —1, 7) 
22. f(x, y)=e? +ln(x— y) a= (2,1) 
E E EEE E E) 
ez 
24. f(x,y,z) = coszln(x + y), a= (e, 0,7/4) 
E 2 (1,2,1) 
. xX, YyY,z) = aia SH LA 
OS py zZ+I 


In Exercises 26—33, find the matrix Df(a) of partial derivatives, 
where f and a are as indicated. 


26. f=, a = (3, 2) 


27. f(x, y,z)=x?+xIn(yz), a= (—3,e,e) 
28. f(x, y,z) = (2x — 3y + 5z, x? + y, In (yz)), 


a = (3, —1, —2) 
29. f(x, y,z)= (xyz, Vx? ty? + z): 
a = (1, 0, —2) 


30. f(t) = (t, cos2t, sin 5t), a=0 
31. f(x, y, z, w) = (3x — 7y + z, 5x + 2z — 8w, 
y—17z+3w), a= (1,2,3,4) 


32. f(x, y) = (x?y, x + y?, cos m xy), 
33. f(s, t) = (s?, st, t°), a= (—1,1) 


a= (Q2, —1) 


Explain why each of the functions given in Exercises 34—36 is 
differentiable at every point in its domain. 

34. f(x, y) = xy — 7x8y? + cosx 
X+y+zZz 


35. f(x,y,z) = ————— 
F(x, y, 2) Pape 


2, 
36. f(x, y) = (= T z) 


x24+y4’? y x 
37. (a) Explain why the graph of z = x° — 7xy + e” has 
a tangent plane at (—1, 0, 0). 
(b) Give an equation for this tangent plane. 


38. Find an equation for the plane tangent to the graph of 
z =4cosxy at the point (7/3, 1, 2). 


39. Find an equation for the plane tangent to the graph of 


z = e*t cos xy at the point (0, 1, e). 


40. Find equations for the planes tangent to z= 
x?—6x+y> that are parallel to the plane 
4x —12y+z=7. 


41. Use formula (8) to find an equation for the hy- 
perplane tangent to the 4-dimensional paraboloid 
xs = 10 — (x? + 3x3 + 2x3 +x?) at the point 
(2,—1, 1,3, —8). 

42. Suppose that you have the following information con- 
cerning a differentiable function f: 

f(2,3)=12, f(1.98,3)= 12.1, f(2,3.01) = 12.2. 
(a) Give an approximate equation for the plane tangent 

to the graph of f at (2, 3, 12). 
(b) Use the result of part (a) to estimate f(1.98, 2.98). 
In Exercises 43—45, (a) use the linear function h(x) in Def- 
inition 3.8 to approximate the indicated value of the given 
function f. (b) How accurate is the approximation determined 
in part (a)? 

43. f(x, y) = e", f(0.1, —0.1) 

44. f(x, y)=3 + coszrxy, f(0.98, 0.51) 

45. f(x, y, z) = x? + xyz + y>z, f(1.01, 1.95, 2.2) 

46. Calculate the partial derivatives of 


Xp txt ee + Xn 


f1, X2, +++) Xn) = . 
(Pramu 
47. Let 
Zð T 
xy =x yt ey 

— if (x, y) # (0,0) 
f@,y)= ory 
0 if (x, y) = (0, 0) 


(a) Calculate df/dx and df/dy for (x, y) Æ (0, 0). 
(You may wish to use a computer algebra system 
for this part.) 


(b) Find f,(0, 0) and f,(0, 0). 


As mentioned in the text, if a function F(x) of a single variable 
is differentiable at a, then, as we zoom in on the point (a, F(a)), 
the graph of y = F(x) will “straighten out” and look like its 
tangent line at (a, F(a)). For the differentiable functions given 


in Exercises 48-51, (a) calculate the tangent line at the indi- 
cated point, and (b) use a computer to graph the function and 
the tangent line on the same set of axes. Zoom in on the point 
of tangency to illustrate how the graph of y = F(x) looks like 
its tangent line near (a, F(a)). 


Q 48. F(x) = - 2x43, al 
Q 49. F(x) =x + sinx, ri 
x? —3x2+x 
50. F(x) = , a=0 
Q 50. F@)=— >. 2 
Q 51. F(x) =n? +), a=-1 


@ 52. 


(a) Use a computer to graph the function F(x) = 
(x — 2p, 
(b) By zooming in near x = 2, offer a geometric dis- 


cussion concerning the differentiability of F at 
x =2., 


As discussed in the text, a function f(x, y) may have partial 
derivatives f(a, b) and f,(a, b) yet fail to be differentiable at 
(a, b). Geometrically, if a function f(x, y) is differentiable at 
(a, b), then, as we zoom in on the point (a, b, f(a, b)), the graph 
ofz = f(x, y) will “flatten out” and look like the plane given 
by equation (4) in this section. For the functions f(x, y) given 
in Exercises 53—57, (a) calculate f,(a,b) and f(a, b) at the 
indicated point (a, b) and write the equation for the plane given 
by formula (4) of this section, (b) use a computer to graph the 
equation z = f(x, y) together with the plane calculated in part 
(a). Zoom in near the point (a, b, f (a, b)) and discuss whether 
or not f(x, y) is differentiable at (a, b). (c) Give an analytic 
(i.e., nongraphical) argument for your answer in part (b). 


Q 53. f@ y)=x° -xy +y’, @d)=2,) 
Q 54. fæ y) = (œ -1 , (a,b) = (1,0) 
xy 
Q 55. f(x,y) = Payal’ (a, b) = (0, 0) 


m 3m 
aa) 
@ 57. f(x,y) =x siny +y cosx, (a,b) = (5 a) 


58. Let g(x, y) = xy. 
(a) Is g continuous at (0, 0)? 
(b) Calculate 0g/dx and 0g/dy when xy Æ 0. 
(c) Show that g,(0, 0) and g,(0, 0) exist by supplying 
values for them. 
(d) Are dg/dx and dg/dy continuous at (0, 0)? 
(e) Does the graph ofz = g(x, y) have a tangent plane 


at (0, 0)? You might consider creating a graph of 
this surface. 


(f) Is g differentiable at (0, 0)? 


@ 56. f(x,y) =sinxcosy, (a,b) = ( 
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59. Suppose f: R” — R” is a linear mapping; that is, 


f(x) = Ax, where x = (x1, X2, ..., Xn) € R” 


and A is an m x n matrix. Calculate Df(x) and relate 
your result to the derivative of the one-variable linear 
function f(x) = ax. 


In Exercises 60—62 you will establish that the matrix Df(a) of 
partial derivatives of the component functions of f is uniquely 
determined by the limit equation in Definition 3.8. 


60. Let X be an open set in R”, let a € X, and let F: X C 
R” — R”. Show that 


lim ||F(x)|| =0 <> lim F(x) = 0. 


61. Let X be an open set in R”, let a € X, and let f: X C 
R” — R”. Suppose that A and B are m x n matrices 


such that 
Iœ- [f(a) + AC a)] | 
m = 
xa |x — all 
li Ifœ@ — [f(a) + Be -a]l 
im =; 
x>a |x — all 


(a) Use Exercise 60 to show that 


(B= A)(x—a) _ 


x>a_||x — all 


(b) Write x — a as th, where h is a nonzero vector in 
R”. First argue that 


(B— A\x—a) | 


lim 0 implies 
sa ix—al 
. (B — A)(th) 
lim ———— = 0, 
10 [th 


and then use this result to conclude that A = B. 
(Hint: Break into cases where t > 0 and where 
t <0.) 


62. Let X be an open set in R”, let a € X, and let f: X C 
R” — R”. Suppose that A is an m x n matrix such 
that 

ji Ifœ@ — [f(a) + A& -alll 
im = 


x8 lx — all 


0. 


In this problem you will establish that A = Df(a). 
(a) Define F: X C R” > R” by 
_ f(x) — f(a) — A(x — a) 

|x — all l 


F(x) 


Identify the ith component function F;(x) us- 
ing component functions of f and parts of the 
matrix A. 


(b) Note that under the assumptions of this problem 
and Exercise 60, we have that limy_,, F(x) = 0. 
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First argue that, for i=1,...,m, we have a Of; 

limy+a F;(x) = 0. Next, argue that (c) Use parts (a) and (b) to show that a;; = ax; 
: vanl; : here a;; denotes the i jth entry of A. (Hint: Break 
(x) = lim F;(a + he;) = 0, mies: 
m PA= implies ho ee into cases where h > 0 and where h < 0.) 


where ej denotes the standard basis vector 
(0,...,1,..., 0) for R”. 


2.4 Properties; Higher-order Partial 
Derivatives 


Properties of the Derivative 


From our work in the previous section, we know that the derivative of a function 
f: X C R” — R” can be identified with its matrix of partial derivatives. We next 
note several properties that the derivative must satisfy. The proofs of these results 
involve Definition 3.8 of the derivative, properties of ordinary differentiation, and 
matrix algebra. 


PROPOSITION 4.1 (LINEARITY OF DIFFERENTIATION) Let f, g: X C R” — R” 
be two functions that are both differentiable at a point a € X, and let c € R be 
any scalar. Then 


1. The function h = f + g is also differentiable at a, and we have 
Dh(a) = D(f + g)(a) = Df(a) + Dg(a). 
2. The function k = cf is differentiable at a and 


Dk(a) = D(cf)(a) = cDf(a). 


EXAMPLE 1 Let f and g be defined by f(x, y) = (x + y, xy sin y, y/x) and 
g(x, y) = (x? + y?, ye*”, 2x3 — 7y). We have 


1 1 
Df(x,y)=| ysiny xsiny+xycosy 
—y/x? 1/x 
and 
2x 2y 
Dax, y)= | Pe et xye? 
6x? —35y4 


Thus, by Theorem 3.10, fis differentiable on R? — {y-axis} and g is differentiable 
on all of R’. If we let h = f + g, then part 1 of Proposition 4.1 tells us that h must 
be differentiable on all of its domain, and 


Dh(x, y) = Df(x, y) + Dg(x, y) 
2x +1 2y+1 


=| ysiny+y’e” xsiny+xycosy +e” +xyer 
6x? — y/x* 1/x — 35y 
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Note also that the function k = 3g must be differentiable everywhere by part 2 
of Proposition 4.1. We can readily check that Dk(x, y) = 3Dg(x, y): We have 


k(x, y) = (3x7 + 3y’, 3ye”, 6x? — 21y°). 


Hence, 
6x 6y 
Dk(x, y)= | 3y’e*” 3e” + 3xye*? 
18x? —105y* 
2x 2y 
= 3 ye e” + xye 
6x? =15;" 
= 3Dg(x, y). + 


Due to the nature of matrix multiplication, general versions of the product 
and quotient rules do not exist in any particularly simple form. However, for 
scalar-valued functions, it is possible to prove the following: 


PROPOSITION 4.2 Let f, g: X C R” — R be differentiable at a € X. Then 
1. The product function fg is also differentiable at a, and 
D(f8)Xa) = 8(a)Df (a) + f(a)Dg(a). 
2. If g(a) Æ 0, then the quotient function f/g is differentiable at a, and 
gD fa) — f@Ds(a) 


D(f/g)(a) = a? 


EXAMPLE 2 If f(x, y, z) = ze”? and g(x, y, z) = xy + 2yz — xz, then 


(fg)(x, y, Z) = (xyz + 2yz? = xzye”, 
so that 


xz7)yer 
xz? xe? 


(yz — 27)e*” + (xyz + 2yz? 
(xz + 227)e*” + (xyz + 2yz* 
(xy + 4yz — 2xz)e*” 


D(fg)Xx, y, z) = 


Also, we have 
Df(x, y, z) = lyse" zee? e] 
and 
Dex, y,z)=[y-z x+2z 2y—x], 
so that 
ge, y, Z)Df (x, y, 2+ F(x, y, DDa(x, y, z) 
(yz — ze 


(xz + 2z7)e*? 
(2yz — xz)e*” 


(xyz + 2y?z? — xyz?) 
=| (yz + 2xyz — x?” | + 
(xy + 2yz — xz)e” 
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T 
xy*z + 2y?2? — xyz? + yz — z? 
=e | x?yz+2xyz? — x? + xz +22 | , 
xy +4yz — 2xz 


which checks with part 1 of Proposition 4.2. (Note: The matrix transpose is used 
simply to conserve space on the page.) + 


The product rule in part 1 of Proposition 4.2 is not the most general 
result possible. Indeed, if f: X C R” — R is a scalar-valued function and 
g: X C R” — R” is a vector-valued function, then if f and g are both differ- 
entiable ata € X, so is fg, and the following formula holds (where we view g(a) 
asan m x | matrix): 


D(fg)a) = ga)Df (a) + f(a)Dgia). 


Partial Derivatives of Higher Order 


Thus far in our study of differentiation, we have been concerned only with partial 
derivatives of first order. Nonetheless, it is easy to imagine computing second- 
and third-order partials by iterating the process of differentiating with respect to 
one variable, while all others are held constant. 


EXAMPLE 3 Let f(x, y, z) = x?y + y?z. Then the first-order partial deriva- 
tives are 


af» af 2 
—=2xy, >= 2yz, and = = y^. 
Ta a +2yz T 
The second-order partial derivative with respect to x, denoted by 07 f/x? or 


Fax (Xs y, z), is 


af ə Ẹ 


0 
əx? ax 1) ax aaa 


Similarly, the second-order partials with respect to y and z are, respectively, 


2 
2 8 (=) - ee | 2yz) = 2z, 


dy? = dy \dy dy 
and 
vf a af) a5 
əz? əz (= ag” ) 


There are more second-order partials, however. The mixed partial derivative 
with respect to first x and then y, denoted 3? f/dyx or fry(x, y, z), is 


rf ə E 
ðyðx dy \ax 


a 
) = ay or»? = 2%. 


There are five more mixed partials for this particular function: 3? f/dxdy, 
Ə? f/dzdx, Ə? f/dxdz, Ə? f/dzdy, and 3? f/dydz. Compute each of them to get 
a feeling for the process. + 


In general, if f: X C R” — R isa (scalar-valued) function ofn variables, the 
kth-order partial derivative with respect to the variables x;,, x;,,..., X (in that 
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order), where i1, i2, . . . , ig are integers in the set {1, 2, .. . , n} (possibly repeated), 
is the iterated derivative 
oF a a 


— = i CF On apnea te: 


OXi,: xe OX, 0Xi, OXi, OXi, OXi, 


Equivalent (and frequently more manageable) notation for this kth-order partial 
is 


Seis xig ig (xı 2X2, +065 Xn). 


Note that the order in which we write the variables with respect to which we 
differentiate is different in the two notations: In the subscript notation, we write 
the differentiation variables from left to right in the order we differentiate, while 
in the d-notation, we write those variables in the opposite order (i.e., from right 
to left). 


EXAMPLE 4 Let f(x, y, z, w) = xyz + xy?w — cos(x + zw). We then have 


ə? ð ə 
Syw(x, y, z, w) = f = (xyz + xy*w — cos(x + zw)) 
ðwðy dwdoy 


a 
= — (xz + 2xyw) = 2xy, 
dw 


and 


a? ə ə 
foyr n W) = f = (xyz + xy*w — cos(x + zw)) 
ý ðyðw ðyðw 


ð . 
= —(xy? + z sin(x + zw)) = 2xy. 
dy + 


Although it is generally ill-advised to formulate conjectures based on a single 
piece of evidence, Example 4 suggests that there might be an outrageously simple 
relationship among the mixed second partials. Indeed, such is the case, as the next 
result, due to the 18th-century French mathematician Alexis Clairaut, indicates. 


THEOREM 4.3 Suppose that X is open in R” and f: X C R” — R has con- 
tinuous first- and second-order partial derivatives. Then the order in which we 
evaluate the mixed second-order partials is immaterial; that is, if i; and i, are any 
two integers between | and n, then 


af af 


Əxi OX, əxi, OXi, 


A proof of Theorem 4.3 is provided in the addendum to this section. We 
also suggest a second proof (using integrals!) in Exercise 4 of the Miscellaneous 
Exercises for Chapter 5. 

It is natural to speculate about the possibility of an analogue to Theorem 4.3 
for kth-order mixed partials. Before we state what should be an easily anticipated 
result, we need some terminology. 
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DEFINITION 4.4 Assume X is open in R”. A scalar-valued function 
f:X CR" — R whose partial derivatives up to (and including) order at 
least k exist and are continuous on X is said to be of class C*. If f has 
continuous partial derivatives of all orders on X, then f is said to be of class 
C™, or smooth. A vector-valued function f: X C R” > R” is of class C* 
(respectively, of class C™) if and only if each of its component functions is 
of class C* (respectively, C°). 


THEOREM 4.5 Let f:X C R” > R be a scalar-valued function of class C*. 
Then the order in which we calculate any kth-order partial derivative does not 


matter: If (ij, ..., ig) are any k integers (not necessarily distinct) between 1 and 
n, and if (ji, ..., je) is any permutation (rearrangement) of these integers, then 
ak f Ə Ff 
əxi: ¿4 OXi, T OX j,* a OX jy, i 


EXAMPLE 5 If f(x, y, z, w) = x? we”? — ze*’ + xyzw, then you can check 


that 
35 35 
eed = 2e” (yz + 1) = f ; 
dxdwdzdydx dzdydwd2x 
verifying Theorem 4.5 in this case. + 


Addendum: Two Technical Proofs 


Proof of Part 1 of Proposition 4.1 


Step 1. We show that the matrix of partial derivatives of h is the sum of 
those of f and g. If we write h(x) as (h1 (x), f2(x), . . . , Am(X)) (i.e., in terms of 
its component functions), then the i jth entry of Dh(a) is 0h; /dx; evaluated at a. 
But h;(x) = fix) + g;(x) by definition of h. Hence, 


ðh; _ Of; + 08; 
ax; ð ax;  Ox;’ 


by properties of ordinary differentiation (since all variables except x; are held 
constant). Thus, 


® (f(a) + 2:(x)) = 
Xj 


dh; dfi gi 
5 (a) = 5) + TAO] 
Xj OX; OX; 


and, therefore, 
Dh(a) = Df(a) + Dg(a). 


Step 2. Now that we know the desired matrix of partials exists, we must 
show that h really is differentiable; that is, we must establish that 


sin UX) = he) + Dha = ah 
ım = 
wa Ix—al 


0. 


(a,b+Ay) (a+Ax,b+Ay) 
e e 


- + 


+ — 


Ge (a+ hes b) 


Figure 2.55 To construct the 
difference function D used in the 
proof of Theorem 4.3, evaluate f 
at the four points shown with the 
signs as indicated. 
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As preliminary background, we note that 


|| h(x) — [h(a) + Dh(a)(x — a)]|| 


IIx —al 
_ If) + gŒ) — [fla) + ga) + Dfa) — a) + Dga) — a)]| 
IIx — al 
_ IEW = [fa) + flax — a)]) + (EŒ — [g(a) + Pga) — a) 
IIx — al 
|f(x) — [f(a) + DAE- a)]||__ gŒ — [g(a) + Dg(a)(x — a)l 
7 IIx — all Ix —al i 


by the triangle inequality, formula (2) of §1.6. To show that the desired limit 
equation for h follows from the definition of the limit, we must show that given 
any € > 0, we can find a number ô > 0 such that 

||h(x) — [h(a) + Dh(a)(x — a)]| 


if 0 < ||x — all < ô, then <€. (1) 
|x — all 


Since f is given to be differentiable at a, this means that given any €; > 0, we can 
find 6; > 0 such that 


Ix) — fa) + D -DI 
Ix — al 


(2) 


if 0 < ||x —al| < ô, then 


Similarly, differentiability of g means that given any €2 > 0, we can find a ô2 > 0 
such that 
Ig&) — [g(a) + D(a) — all 

<E 


if 0 < ||x —al| < ô2, then 
lx — all 


(3) 


Now we're ready to establish statement (1). Suppose € > 0 is given. Let 6, 
and 55 be such that (2) and (3) hold with €; = €2 = €/2. Take ô to be the smaller 
of 5; and 52. Hence, if 0 < ||x — al] < 6, then both statements (2) and (3) hold 
(with €; = €2 = €/2) and, moreover, 


hœ) — [h(a) + Dh(a)(x — a)] || f(x) — [f(a) + Df(a)(x — a)]]|| 


l 


I|x — all - lx — all 
_ llg@) — [g(a) + Dga) — alll 
| IIx —al 
<E + € 
€ € 
= 2 + 7 =€. 
That is, statement (1) holds, as desired. a 


Proof of Theorem 4.3 For simplicity of notation only, we'll assume that f is a 
function of just two variables (x and y). Let the point (a, b) € R? be in the interior 
of some rectangle on which fx, fy, fix, Ary, fry, and fyx are all continuous. 
Consider the following “difference function.” (See Figure 2.55.) 


D(Ax, Ay) = f(a + Ax, b + Ay) — f(a + Ax, b) 
f(a, b+ Ay) + f(a, b). 
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(a,b+Ay) (a+Ax,b+ Ay) 


R 


(cd) x 


(a, b) (a + Ax, b) 


Figure 2.56 Applying the mean 
value theorem twice. 


Our proof depends upon viewing this function in two ways. We first regard D as 
a difference of vertical differences in f: 


D(Ax, Ay) = [f(a + Ax, b+ Ay) — f(a + Ax, b)] 


—[fla, b + Ay) — f(a, b)] 
= F(a + Ax) — F(a). 


Here we define the one-variable function F(x) to be f(x, b + Ay) — f(x, b). As 
we will see, the mixed second partial of f can be found from two applications of 
the mean value theorem of one-variable calculus. Since f has continuous partials, 
itis differentiable. (See Theorem 3.10.) Hence, F is continuous and differentiable, 
and, thus, the mean value theorem implies that there is some number c between 
a anda + Ax such that 


D(Ax, Ay) = F(a + Ax) — F(a) = F'(c)Ax. (4) 


Now F'(c) = fi(c, b + Ay) — f(c, b). We again apply the mean value theorem, 
this time to the function f,(c, y). (Here, we think of c as constant and y as the 
variable.) By hypothesis fy is differentiable since its partial derivatives, fxx and 
fry, are assumed to be continuous. Consequently, the mean value theorem applies 
to give us a number d between b and b + Ay such that 


F'(c) = f(c, b + Ay) — file, b) = fay(c, DAY. (5) 
Using equation (5) in equation (4), we have 
D(Ax, Ay) = F'(c)Ax = fale d)AyAx. 


The point (c, d) lies somewhere in the interior of the rectangle R with vertices 
(a, b), (a+ Ax, b), (a, b + Ay), (a+ Ax, b + Ay), as shown in Figure 2.56. 
Thus, as (Ax, Ay) —> (0, 0), we have (c, d) —> (a, b). Hence, it follows that 


fxy(C, d) = fey(a, b) as (Ax, Ay) = (0, 0), 
since f,, is assumed to be continuous. Therefore, 
fola, b) = fayle.d) = Pes OY) 
xy ’ x lim aa or Cee 
aN (Ax un (0,0) 272 (Ax,Ay)+(0,0) AyAx 


On the other hand, we could just as well have written D as a difference of 
horizontal differences in f: 


D(Ax, Ay) = [f(a + Ax, b + Ay) — f(a, b+ Ay)] 
—[f(a+ Ax, b)— f(a, )] 
= G(b + Ay) — G(b). 


Here G(y) = f(a + Ax, y)— f(a, y). As before, we can apply the mean value 
theorem twice to find that there must be another point (¢, d) in R such that 


D(Ax, Ay) = G’(d)Ay = fyx(@, d)AxAy. 


Therefore, 


fyrla, b) = PEDE = 
(a, „(€ d) = ———_— 
: (Ax, 1 i; 0? (Ax, mi g 0) AxAy 


Because this is the same limit as that for f(a, b) just given, we have established 
the desired result. E 


2.4 Exercises 


In Exercises 1—4, verify the sum rule for derivative matrices 
(i.e., part I of Proposition 4.1) for each of the given pairs of 
functions: 
1. f(x, y)=xy+cosx, g(x, y)= sin(xy) + y? 
2. f(x, y) = (e, xe”), g(x, y) = (In(xy), ye”) 
3. f(x, y, z) = (xsin y +z, ye? — 3x’), g(x, y,z)= 
(x? cos x, xyz) 


4. f(x, y, z) = (xyz?, xe, ysinxz), g(x, y, z) = 
(x -y +y +27, naæz+ 2) 


Verify the product and quotient rules (Proposition 4.2) for the 
pairs of functions given in Exercises 5—8. 


5. fœ, y)=xXy +y, gx, y= 


se 


6. f(x,y) =e”, g(x,y) =xsin2y 
7. f(x, y)=3xy +y, g(x, y) = x? —2xy? 
8. f(x,y,z) = x cos (yz), 

g(x,y, z) =x? + xy +yz +2 


For the functions given in Exercises 9-17 determine all second- 
order partial derivatives (including mixed partials). 


9. f(x,y) = xy! + 3xy? — Ixy 
10. f(x, y) =cos(xy) 
11. f(x, y) = 2 — ye* 


12. f(x, y) = sin yx? + y? 


13. f(x, y) = — 
fay) sin’ x + 2e 


14. f(x, y) = e 


15. f(x,y) = ysinx — x cos y 
16. fæ» =mn(*) 
y 


17. f(x, y) = x2 + e% 
18. f(x, y,2) = —— 
+ f(x,y,z) = —— 
5 y+tz 
19. f(x, y, z) = x? yz + xy?z + xyz? 
20. f(x, y, z) = e7 
21. f(x, y, z) =e“ sin y +e cosz 
22. Consider the function F(x, y, z) = 2x°y + xz? + 
y?z? — 7xyz. 
(a) Find Fy, Fyy, and F;z. 


(b) Calculate the mixed second-order partials F,y, 
Fyz, Fezi Fas Fyz and Fry, and verify Theorem 
4.3. 


23. 


24. 


25. 


26. 


27. 


28. 
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(c) Is Fyy, = Fyxy? Could you have known this with- 
out resorting to calculation? 


(d) Is Fyz = Fyzx? 


Let f(x, y)= ye™. Give general formulas for 
a” f/ðx” and a” f/dy", where n > 2. 


Let f(x, y, z) = xe?” + ye +ze™. Give general 
formulas for 3” f/dx", Ə” f/dy", and 0” f/dz", where 
n> 1. 


Let f(x, y,z)=In (=). Give general formulas for 


z 
a” f/ðx", Ə” f/dy", and a” f/dz", where n > 1. What 
can you say about the mixed partial derivatives? 


Let f(x, y, z) =x’ y?z3 — 2x*yz. 
(a) What is 34 f/3x?3y3z? 

(b) What is 0° f/dx*dydz? 

(c) What is 85 f/x? dydz? 


Recall from §2.2 that a polynomial in two variables x 
and y is an expression of the form 


d 


P(X, y)= 5 cux*y', 


k,l=0 


where cx can be any real number for 0 < k,l < d. The 
degree of the term cj;x" y! when cy 4 0 is k + l and 
the degree of the polynomial p is the largest degree 
of any nonzero term of the polynomial (i.e., the largest 
degree of any term for which cy 4 0). For example, 
the polynomial 


D(x, y)= Tx°y? + 2x73 —3xt- 5xy? +1 


has five terms of degrees 15, 5, 4, 4, and 0. The de- 

gree of p is therefore 15. (Note: The degree of the zero 

polynomial p(x, y) = 0 is undefined.) 

(a) If p(x, y) = 8x7y!? — 9x?y + 2x, what is the de- 
gree of dp/dx? dp/dy? Ə? p/ðx?? d*p/ady*? 
a? p/dxdy? 

(b) If p(x, y) = 8x?y + 2x3 y, what is the degree of 
dp/Ix? dp/dy? Ə? p/dx2? a? p/dIy?? Ə p/Ixdy? 

(c) Try to formulate and prove a conjecture relating 
the degree of a polynomial p to the degree of its 
partial derivatives. 


The partial differential equation 


2 2 2 
Pf Pf PF 


=0 
əx? ay? Az? 


is known as Laplace’s equation, after Pierre Simon 
de Laplace (1749-1827). Any function f of class C? 
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that satisfies Laplace’s equation is called a harmonic 

function.’ 

(a) Is f(x, y,z) =x? + y? — 2z? harmonic? What 
about f(x, y, z) =x? — y?+27? 

(b) We may generalize Laplace’s equation to functions 
of n variables as 


af 


2 
Ox; 


ə? a? 
a ae) 
əx 


+ 
Give an example of a harmonic function of n vari- 
ables, and verify that your example is correct. 


The three-dimensional heat equation is the partial dif- 
ferential equation 
aT aT 


n aT 3T 

(= T dy? T z) at? 

where k is a positive constant. It models the tempera- 

ture T(x, y, z, t) at the point (x, y, z) and time ¢ of a 

body in space. 

(a) We examine a simplified version of the heat equa- 
tion. Consider a straight wire “coordinatized” by 
x. Then the temperature T(x, t) at time t and po- 
sition x along the wire is modeled by the one- 
dimensional heat equation 


T ƏT 
ðx? at” 


Show that the function T(x, t) = e~ cos x satis- 
fies this equation. Note that if t is held constant at 
value fo, then T(x, to) shows how the temperature 
varies along the wire at time tọ. Graph the curves 
z = T(x, to) for to = 0, 1, 10, and use them to un- 
derstand the graph of the surface z = T(x, t) for 
t > 0. Explain what happens to the temperature of 
the wire after a long period of time. 


Show that T(x, y, t) = e~“(cos x + cos y) satis- 
fies the two-dimensional heat equation 


ƏT ƏT ƏT 
k + = —, 
ax? dy? ot 


(b) 


2.5 


30. 


Graph the surfaces given by z = T(x, y, fo), where 
to = 0, 1, 10. If we view the function T(x, y, t) as 
modeling the temperature at points (x, y) of a flat 
plate at time t, then describe what happens to the 
temperature ofthe plate after a long period of time. 

(c) Now show that T(x, y,z,t)= e™ (cosx + 
cos y + cos z) satisfies the three-dimensional heat 
equation. 


Let 


x2 + y2 
0 if (x, y) = (0, 0) 


(a) Find fx(x, y)and f,(x, y) for (x, y) Æ (0, 0). (You 
will find a computer algebra system helpful.) 


2 y2 
meas » (555) a OU), 


(b) Either by hand (using limits) or by means of 
part (a), find the partial derivatives f.(0, y) and 
Fy(x, 0). 

(c) Find the values of f,,(0, 0) and fyx(0, 0). Recon- 
cile your answer with Theorem 4.3. 


A surface that has the least surface area among all surfaces 
with a given boundary is called a minimal surface. Soap bub- 
bles are naturally occurring examples of minimal surfaces. It 
is a fact that minimal surfaces having equations of the form 
z= f(x, y) (where f is of class C?) satisfy the partial differ- 
ential equation 


(1+2?) Sex + (1+2) Gy = Deze: (6) 


Exercises 31—33 concern minimal surfaces and equation (6). 


31. 


Q 32. 


Q 33. 


Show that a plane is a minimal surface. 

Scherk’s surface is given by the equation e* cos y = 
cos x. 

(a) Use a computer to graph a portion of this surface. 
(b) Verify that Scherk’s surface is a minimal surface. 

One way to describe the surface known as the helicoid 
is by the equation x = y tan z. 

(a) Use a computer to graph a portion of this surface. 
(b) Verify that the helicoid is a minimal surface. 


The Chain Rule 


Among the various properties that the derivative satisfies, one that stands alone 
in both its usefulness and its subtlety is the derivative’s behavior with respect 
to composition of functions. This behavior is described by a formula known as 


3 Laplace did fundamental and far-reaching work in both mathematical physics and probability theory. 
Laplace’s equation and harmonic functions are part of the field of potential theory, a subject that Laplace 
can be credited as having developed. Potential theory has applications to such areas as gravitation, elec- 
tricity and magnetism, and fluid mechanics, to name a few. 
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the chain rule. In this section, we review the chain rule of one-variable calculus 
and see how it generalizes to the cases of scalar- and vector-valued functions of 
several variables. 


The Chain Rule for Functions of One Variable: A Review — 


We begin with a typical example of the use of the chain rule from single-variable 
calculus. 


EXAMPLE 1 Let f(x) = sinx and x(t) = t° + t. We may then construct the 
composite function f(x(t)) = sin(t? + t). The chain rule tells us how to find the 
derivative of f o x with respect to t: 


(foxy (t) = £ (sin(t + t)) = (cos(t? + tt + 1). 
Since x = £? +1, we have 


d d 
(foxy) = ina): 56 +1) = F'O) xO. + 


In general, suppose X and T are open subsets of R and f: X C R— R 
and x:T C R —> R are functions defined so that the composite function 
f ox:T — R makes sense. (See Figure 2.57.) In particular, this means that the 
range of the function x must be contained in X, the domain of f. The key result 
is the following: 


R R R 


Figure 2.57 The range of the function x must be contained in the domain X of f in 
order for the composite f o x to be defined. 


THEOREM 5.1 (THE CHAIN RULE IN ONE VARIABLE) Under the preceding as- 
sumptions, if x is differentiable at tg € T and f is differentiable at x9 = x(to) € X, 
then the composite f o x is differentiable at tọ and, moreover, 


(f o x) (to) = f'(x0)x'(to). 0) 
A more common way to write the chain rule formula in Theorem 5.1 is 
df df dx 
— (to) = — — (to). 2 
ar © geo gr © (2) 


Although equation (2) is most useful in practice, it does represent an unfor- 
tunate abuse of notation in that the symbol f is used to denote both a function 
of x and one of t. It would be more appropriate to define a new function y by 
y(t) = (f ox)(t) so that dy/dt = (df/dx)(dx /dt). But our original abuse of no- 
tation is actually a convenient one, since it avoids the awkwardness of having too 
many variable names appearing in a single discussion. In the name of simplicity, 
we will therefore continue to commit such abuses and urge you to do likewise. 

The formulas in equations (1) and (2) are so simple that little more needs 
to be said. We elaborate, nonetheless, because this will prove helpful when we 
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generalize to the case of several variables. The chain rule tells us the following: 
To understand how f depends on t, we must know how f depends on the “in- 
termediate variable” x and how this intermediate variable depends on the “final” 
independent variable t. The diagram in Figure 2.58 traces the hierarchy of the 
variable dependences. The “paths” indicate the derivatives involved in the chain 
rule formula. 


Dependent Intermediate Final 
variable variable variable 


df dx 
poe- ý i TN 
f t 
df 
dt 


Figure 2.58 The chain rule for functions of a single variable. 


y 


x Range x f 


R? 


Figure 2.59 The composite function f o x. 


The Chain Rule in Several Variables 


Now let’s go a step further and assume f:X C R? > R is a C! function of 
two variables and x: T C R — R? is a differentiable vector-valued function 
of a single variable. If the range of x is contained in X, then the composite 
fox:T C R — R is defined. (See Figure 2.59.) It’s good to think of x as 
describing a parametrized curve in R? and f as a sort of “temperature func- 
tion” on X. The composite f o x is then nothing more than the restriction of f to 
the curve (i.e., the function that measures the temperature along just the curve). 
The question is, how does f depend on t? We claim the following: 


PROPOSITION 5.2 Suppose x:T C R —> R°? is differentiable at tọ € T, and 
f:X CR’ = R is differentiable at xp = x(t) = (xo, yo) € X, where T and X 
are open in R and R’, respectively, and range x is contained in X. If, in addition, 
f is of class C!, then f ox: T — R is differentiable at to and 


d 0 d 0 d 
Sa) = Hoa) (0) 4 Z). 


Before we prove Proposition 5.2, some remarks are in order. First, notice 
the mixture of ordinary and partial derivatives appearing in the formula for the 


Figure 2.61 The graph of the 
function x of Example 2. 
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Dependent Intermediate Final 
variable variables variable 
af dx 
po i an [O 
f t 

ay dt 
df 
dt 


Figure 2.60 The chain rule of Proposition 5.2. 


derivative. These terms make sense if we contruct an appropriate “variable hier- 
archy” diagram, as shown in Figure 2.60. At the intermediate level, f depends 
on two variables, x and y (or, equivalently, on the vector variable x = (x, y)), 
so partial derivatives are in order. On the final or composite level, f depends on 
just a single independent variable ¢ and, hence, the use of the ordinary derivative 
df /dt is warranted. Second, the formula in Proposition 5.2 is a generalization of 
equation (2): A product term appears for each of the two intermediate variables. 


EXAMPLE 2 Suppose f(x, y) = (x + y?)/(2x? + 1) is a temperature func- 
tion on R? and x(t) = (2t, t + 1). The function x gives parametric equations 
for a line. (See Figure 2.61.) Then 
2t+ (+1?  ?4+4t+1 
8&2?+1 = 8Be+l 
is the temperature function along the line, and we have 
df — 4—14t — 3217 
dt (8+1 ° 
by the quotient rule. Thus, all the hypotheses of Proposition 5.2 are satisfied and 
so the derivative formula must hold. Indeed, we have 


Of _ 1 — 2x? — 4xy? 


(f o 96) = f(x@)) = 


dx (2x2 + 1%’ 
of ay 
dy 2x24 17 
and 
dx dy 
í = aia ot aoe = Jl 
v= (F.2)=a0 
Therefore, 
əf dx | dafdy _ 1 — 2x? — 4xy? ay j 
axdt ` dydt (2x? +1) " 2x2 +1 


_ 20 — 8P- 84 +1?) | 24 +1) 
(812 + 1)? Y Be 41’ 
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after substitution of 2t for x and t + 1 for y. Hence, 
ðf dx | afdy _ 2(2—7t— 161’) 


ax dt dy dt (822+12 ’ 
which checks with our previous result for df/dt. + 


Proof of Proposition 5.2 Denote the composite function f o x by z. We want to 
establish a formula for dz/dt at tọ. Since z is just a scalar-valued function of one 
variable, differentiability and the existence of the derivative mean the same thing. 
Thus, we consider 

dz . z(t) — 2(to) 

— (to) = lim ————— 

Ji (to) 


t—to t — to 


and see if this g exists. We have 
Ein J= m LOO y(t)) — f (x(t), y(to)) 


t — to 


The first step is to rewrite the numerator of the limit expression by subtracting 
and adding f(x, y) and to apply a modicum of algebra. Thus, 


dz 4 f(x, y)— f(xo, y) + f(xo, y) — fo, yo) 
— (to) = lim 
dt t>to t — to 
=j fœ, y)— fo, y) . f (xo, y) — f (xo, yo) 
= lim + lim : 
>I t —t tI t — to 


(Remember that x(to) = Xo = (xo, yo).) Now, for the main innovation of the proof. 
We apply the mean value theorem to the partial functions of f. This tells us that 
there must be a number c between x9 and x and another number d between yo 
and y such that 


f(x, y) — fo, y) = f(c, yx — xo) 


and 


f (xo, y) — f(xo, Yo) = fy(xo, dy — yo). 
Thus, 


dz ý xX — Ki i 
— (to) = lim fr(c, y) + lim fy (xo, 
dt t>t t — to t>t 


x(t)—x(to) | | 0- y(to) 


= lim f(c, y) ————— +1 , 
ION e ae) ae 
= fex, we (00) + fio. we = (to) 
by the definition of the ee 


dx dy 
PPAG and a) 


and the fact that f,(c, y) and f,(xo, d) must approach fx(xo, yo) and fy(xo, yo), 
respectively, as t approaches to, by continuity of the partials. (Recall that f was 
assumed to be of class C!. ) This completes the proof. a 


Proposition 5.2 and its proof are easy to generalize to the case where f 
is a function of n variables (i.e., f: X C R” > R) and x : T C R — R”. The 


R2 
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appropriate chain rule formula in this case is 


df af of 


af 
op) = 
dt ( 0) Oxy Che) 


| 

© Xn 
Note that the right side of equation (3) can also be written by using matrix notation 
so that 


d d d n 
(xo) = (to) 4 (xo) (to) Boe (xo) = wk 0) 


d 
—*(to) 
aN 
df. _ | of af af Ty © 
a = [Zw Ja Ar ae oo) | j 
dXn 


| Th (to) | 


Thus, we have shown 


oF (to) = Df (xo) Dx(to) = V f(0) +x (to) 


dt 


where we use x’(fp) as a notational alternative to Dx(to). The version of the chain 
rule given in formula (4) is particularly important and will be used a number of 
times in our subsequent work. 


Let us consider further instances of composition of functions of many 
variables. For example, suppose X is open in R°, T is open in R’, and 
f:X CR’ > Randx:T C R? > R? are such that the range of x is contained 
in X. Then the composite f ox: T C R? — R can be formed, as shown in 
Figure 2.62. Note that the range of x, that is, x(T), is just a surface in R?, so 
f o x can be thought of as an appropriate “temperature function” restricted to this 
surface. If we use x = (x, y, z) to denote the vector variable in R? and t = (s, t) 
for the vector variable in R, then we can write a plausible chain rule formula 
from an appropriate variable hierarchy diagram. (See Figure 2.63.) Thus, it is 


Figure 2.62 The composite f o x where f: X C R? > R and x: T C R? > RÈ. 
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Dependent Intermediate Final 
variable variables variables 

of x 
Ox 
ôf 
0 

f y 
of 
Oz Z 


Figure 2.63 The chain rule for f o x, where f: X C R? > R and 
x: T C R? > R’. 


reasonable to expect that the following formulas hold: 


af af dx ƏfƏðy , af az 


ds oxds  dyds dzds 


and (5) 


af afax . af ay | af az 


ət ax Ot = Ay Of_~—sAz Ot 
(Again, we abuse notation by writing both df/ds, df/dt and df/dax, df/day, 
df/dz.) Indeed, when f is a function of x, y, and z of class C!, formula (3) 
with n = 3 applies once we realize that 0x/ds, dx/dt, etc., represent ordinary 
differentiation of the partial functions in s or t. 


EXAMPLE 3 Suppose 


f(x,y,z) =x? +y? +z? and x(s,t) =(scost, e", s? — t’). 


Then A(s, t) = f ox(s, t) = s? cos? t + e** + (s? — #7)’, so that 


əh ə 

— = Fes) = 2s cos? t + 2te*™ + 4s(s* — t°) 

ðs ðs 

əh ə i 

= re 3 = —2s cost sint + 2se* — 4t(s? — t°). 


We also have 


af af 
2o ee 


=2x, == 2y, = 22 
Ox dy Oz 
and 
0 Ox : 
—-=cost, —=-—-ssinft, 
ð ot 
a 
W c Uc 
Os ot 
Oz az 
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Hence, we compute 


of (fox) dafdx | dfdy | df dz 
ðs ðs ~ ðxðs | dyads | dzas 


= 2x(cost) + 2y(te”) + 2z(2s) 
= 2s cos t(cos t) + 2e” (te) + 2(s* — t?)(2s) 


= 2s cos? t + 2te” + 45(s* — t°), 


just as we saw earlier. We leave it to you to use the chain rule to calculate df/dt 
in a similar manner. + 


Of course, there is no need for us to stop here. Suppose we have an open set 
X in R”, an open set T in R”, and functions f:X — R and x: T —> R” such 
that h = f o x: T — R can be defined. If f is of class C! and x is differentiable, 
then, from the previous remarks, h must also be differentiable and, moreover, 


oh _ af Ox, l Of 3x2 i Of OXm 
Ot; Ox, Ot; | Ox) Ot; ` Pxm Ot; 
maf a 
Sh 
kal OXK Ot; 


Since the component functions of a vector-valued function are just scalar-valued 
functions, we can say even more. Suppose f: X C R” — R? and x: T C R” > 
R” are such that h = fox: T C R” — R? can be defined. (As always, we assume 
that X is open in R” and T is open in R”.) See Figure 2.64 for a representation of 
the situation. If fis of class C! and x is differentiable, then the composite h = fox 
is differentiable and the following general formula holds: 


ah; ws af; Ox, 
tj 7 kel OX, dt; , 


i=1,2,...,p; f=1,2,...,n. (6) 


The plausibility of formula (6) is immediate, given the variable hierarchy diagram 
shown in Figure 2.65. 


T X 
x f 
R” R? 


R” 


Figure 2.64 The composite f o x where f: X C R” —> R? and x: T C R” > R”. 


Now comes the real “magic.” Recall that if A is a p x m matrix and B is an 
m x n matrix, then the product matrix C = AB is defined and is a p x n matrix. 
Moreover, the ijth entry of C is given by 


m 


cij = > aikbkj. 
k=l 
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Dependent Intermediate Final 
variables variables variables 


af, OX 
Ox, Ot 
EEE 
f af; : OX h 
OX Ot, 
h b 
X2 
af, ` OXm 
OX i Oty 
fa Xm bn 


Figure 2.65 The chain rule diagram for f o x, where f: X C R” — R? and 
x: T C R” > R”. 


If we recall that the ijth entry of the matrix Dh(t) is ðh;/ðt;, and similarly for 
Df(x) and Dx(t), then we see that formula (6) expresses nothing more than the 
following equation of matrices: 


Dh(t) = D(fo X(t) = Df(x) Dx(t). (7) 


The similarity between formulas (7) and (1) is striking. One of the reasons 
(perhaps the principal reason) for defining matrix multiplication as we have is 
precisely so that the chain rule in several variables can have the elegant appearance 
that it has in formula (7). 


EXAMPLE 4 Suppose f:R?’—R?’ is given by f(x}, x2,x3)= 
(x1 — X2, X1X2x3) and x: R? —> R? is given by x(t, b) = (tib, t; B): Then 
fo x: R? > R? is given by (f o x)(t1, h) = (tih — t?, #723), so that 


D(fo x)(t) = | emi A | 


CT 3RR 
On the other hand, 
t t 
1 =l 0 ~ 
Df(x) = and Dx(t)= | 2h 0 |, 
X2X3 X1{X3 X1X2 
0 2to 


so that the product matrix is 


Df(x) Dx(t) = | o= "i | 


X2X3bħ + 2x1X3f1  xX2X3tı + 2x1X2ħ2 


= h — 2t ti 
| PB +2 Re+ |’ 
after substituting for x1, x2, and x3. Thus, D(fo x)(t) = Df(x)Dx(t), as expected. 
Alternatively, we may use the variable hierarchy diagram shown in Figure 2.66 
and compute any individual partial derivative we may desire. For example, 
afa Ofr Ox] l afa ƏxX2 l afz 0x3 
atı Ox, Of; OX) Ot; i 0x3 Oty 
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Dependent Intermediate Final 
variables variables variables 


Figure 2.66 The variable hierarchy diagram for Example 4. 


by formula (6). Then by abuse of notation, 


0 
ŽP = aax) + Dn) + O) 
1 
= (PEN) + (AN) 
San, 
which is indeed the (2, 1) entry of the matrix product. + 


At last we state the most general version of the chain rule from a technical 
standpoint; a proof may be found in the addendum to this section. 


THEOREM 5.3 (THE CHAIN RULE) Suppose X C R” and T C R” are open and 
f: X — R? and x: T — R” are defined so that range x C X. If x is differen- 
tiable at tọ € T and f is differentiable at xp = x(tọ), then the composite f o x is 
differentiable at tọ, and we have 


D(f o x)(to) = Df(xo)Dx(to). 


The advantage of Theorem 5.3 over the earlier versions of the chain rule we 
have been discussing is that it requires f only to be differentiable at the point in 
question, not to be of class C!. Note that, of course, Theorem 5.3 includes all 
the special cases of the chain rule we have previously discussed. In particular, 
Theorem 5.3 includes the important case of formula (4). 


EXAMPLE 5 Let f: R? — R? be defined by f(x, y) = (x — 2y + 7, 3xy’). 
Suppose that g: R? —> R? is differentiable at (0,0,0) and we know that 
g(0, 0, 0) = (—2, 1) and 


24 5 
Dg(0, 0, 0) = | 101 |. 
We use this information to determine D(f o g)(0, 0, 0). 


First, note that Theorem 5.3 tells us that f o g must be differentiable at (0, 0, 0) 
and, second, that 


D(£ o g)(0, 0, 0) = Df(g(0, 0, 0)) Dg(0, 0, 0) = Df(—2, 1)Dg(0, 0, 0). 
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Since we know f completely, it is easy to compute that 


1 -2 1 -2 
Df(x, y) = | Sy) bey | so that Df(—2, 1) = | 3 12 i 


Thus, 


1 -2 245 4 4 3 
awawi a ea 


We remark that we needed the full strength of Theorem 5.3, as we do not know 
anything about the differentiability of g other than at the point (0, 0, 0). + 


EXAMPLE 6 (Polar/rectangular conversions) Recall that in §1.7 we pro- 
vided the basic equations relating polar and rectangular coordinates: 

x =r cos 

y=rsin0 ` 


Now suppose you have an equation defining a quantity w as a function of x and 
y; that is, 


w= f(x, y). 


Then, of course, w may just as well be regarded as a function of r and 0 by 
susbtituting r cos 0 for x and r sin 8 for y. That is, 


w = g(r, 0) = f(r, 9), y(r, 0)). 


Our question is as follows: Assuming all functions involved are differentiable, 
how are the partial derivatives dw/dr, Əw/30 related to ðw/3x, dw/dy? 

In the situation just described, we have w = g(r, 0) = (f o x)(r, 9), so that 
the chain rule implies 


Dg(r, 0) = Df (x, y)Dx(r, 0). 


Therefore, 
Ox Ox 
je jf uje 7 
or 00 


_[ of əf cos@ —r sind 

— | Ox dy sinô  rcos@ |’ 
By extracting entries, we see that the various partial derivatives of w are related 
by the following formulas: 


dw dw ; dw 
— = cos — + sind — 
or Ox oy 
dw dw (8) 
= —r sin Hr cos — 
00 Ox dy 


The significance of (8) is that it provides us with a relation of differential 
operators: 
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The appropriate interpretation for (9) is the following: Differentiation with 
respect to the polar coordinate r is the same as a certain combination of differen- 
tiation with respect to both Cartesian coordinates x and y (namely, the combina- 
tion cos@ 0/dx + sin@ d/dy). A similar comment applies to differentiation with 
respect to the polar coordinate 6. Note that, whenr 4 0, we can solve algebraically 
for 0/dx and 0/dy in (9), obtaining 


We will have occasion to use the relations in (9) and (10), and the method of 
their derivation, later in this text. + 


Addendum: Proof of Theorem 5.3 


We begin by noting that the derivative matrices Df(xọ) and Dx(tọ) both exist 
because fis assumed to be differentiable at xọ and x is assumed to be differentiable 
at ty. Thus, the product matrix Df(xo) Dx(to) exists. We need to show that the limit 
in Definition 3.8 is satisfied by this product matrix, that is, that 

TA — [£o x)(to) + Df(xo)Dx(to)(t — toll _ 


li 
t>to \|t — toll 


0. (11) 


In view of the uniqueness of the derivative matrix, it then automatically follows 
that f o x is differentiable at tọ and that Df(xo)Dx(to) = D(f o x)(to). Thus, we 
entirely concern ourselves with establishing the limit (11) above. 
Consider the numerator of (11). First, we rewrite 
(fo x)(t) — [(fo x)(to) + Df(xo)Dx(to)(t — to)] 
= (fo x)(t) — (fo x)(to) — Df(xo)(x() — x(to)) 
+ Df(xX0)(x(t) — x(to)) — Df(xo) Dx(to)(t — to). 


Then we use the triangle inequality: 


IŒ o x(t) — [(fo x)(to) + Df(x0)Dx(to)(t — to)] II 
< |Œ ° x)(t) — (fo x)(to) — Do )(x() — x(t) 
+ || DEZ) — x(to)) — DFX) Dx(to)(t — to) 
= (fo x)(t) — (fo x)(to) — D£(x0)(x() — x(t) 
+ || Df(Xo) [&(®) — x(to)) — Dx(to)(t — to)] Il. 
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By inequality (9) in the proof of Theorem 3.9, there is a constant K such that, for 
any vector h € R”, || Df(xo)h|| < K ||h||. Thus, 
(£o x(t) — (fo x)(to) — Df(xo) Dx(to)(t — to) 
< ||(fo x)(t) — (fo x)(to) — DEZ) — xl 02) 
+ K||x(t) — x(to) — Dx(to)(t — to)l|. 


To establish the limit (11) formally, we must show that given any € > 0, we 
may find a ô > 0 such that if 0 < ||t — tol] < ô, then 


IŒ ° x)(t) — [(£ 0 x)(to) + D£(x0) Dx(to)(t — to)] 
IIt — toll 


Consider the first term of the right side of (12). Using the differentiability of x at 
tọ and inequality (11) in the proof of Theorem 3.9, we can find some ôọ > 0 and 
a constant Ko such that if 0 < ||t — to|| < do, then 


lx) — x(to)l| < Kollt — toll. 


Il 
<€ 


By the differentiability of f at xo, given any €; > 0, we may find some 6; > 0 
such that if 0 < ||x — xo|| < 6), then 


IIx) — [fx0) + D£(xo)(x — x0)] |I 
IIx — xoll 


Set €; = €/(2Ko). With x = x(t), x9 = x(tọ), we have that if both 0 < ||t — to|| < 
ôo and 0 < ||t — to|| < 6;/Ko, then 


I|x(t) — x(to)|| < Kollt — toll < 41. 


<E 


Hence, 


I(E o x)(t) — (fo x)(to) — Df(xo)(x(t) — x(to))l| < e llx(t) — x(to)ll 
A (13) 


Now look at the second term of the right side of (12). Since x 1s differentiable 
at tp, given any €) > 0, we may find some 42 > 0 such that if 0 < ||t — tol] < 6), 
then 


< €1Ko||t — tol] = 


Ix(t) — [x(to) + Dx(to)(t — to)] Il 
<€ 
It — toll 
Set e2 = €/(2K). Then, for 0 < ||t — to|| < 62, we have 


IIx(t) — [x(t) + Dx(to)(t — to)] || < - It — toll. (14) 


Finally, let ô be the smallest of ôo, 8&1 / Ko, and 52. Then, for 0 < ||t — to|| < 6, we 
have that both the inequalities (13) and (14) hold and thus (12) becomes 


II(fo x)(t) — (fo x)(to) — D£(Xo) Dx(to)(t — to)| 


E € 
t-t -K ( t-t ) 
< gllt—toll + K (sell — toll 


= e||t — toll. 


Hence, 


(£ o x(t) — (fo x)(to) — Df(xo) Dx(to (t — to)|| mp 
Iit — toll 


as desired. E 


` 


2.5 Exercises 


. If fx, y, z) =x? —y + xyz, and x = 6t +7, y= 
sin 2r, z = t°, verify the chain rule by finding df/dt 
in two different ways. 


. If f(x, y) = sin (xy) and x = s + t, y = s? + t°, find 
df/ds and Əf/ðt in two ways: 
(a) by substitution. 


(b) by means of the chain rule. 


. Suppose that a bird flies along the helical curve 
x = 2cost, y = 2sint,z = 3t. The bird suddenly en- 
counters a weather front so that the barometric pres- 
sure is varying rather wildly from point to point as 
P(x, y, z) = 6x?z/y atm. 

(a) Use the chain rule to determine how the pressure 

is changing at t = x /4 min. 
(b) Check your result in part (a) by direct substitution. 


(c) What is the approximate pressure at t = m /4 + 
0.01 min? 


. Suppose that z = x? + y?, where x = st and y is a 
function of s and t. Suppose further that when (s, t) = 


3 
(2, 1), dy/dt = 0. Determine o, 1). 


. You are the proud new owner of an Acme Deluxe Bread 
Kneading Machine, which you are using for the first 
time today. Suppose that at noon the dimensions of your 
(nearly rectangular) loaf of bread dough are L = 7 in 
(length), W = 5 in (width), and H = 4 in (height). At 
that time, you place the loaf in the machine for knead- 
ing and the machine begins by stretching the loaf’s 
length at an initial rate of 0.75 in/min, punching down 
the loaf’s height at a rate of 1 in/min, and increasing 
the loaf’s width at a rate of 0.5 in/min. What is the rate 
of change of the volume of the loaf when the machine 
starts? Is the dough increasing or decreasing in size at 
that moment? 


. Arectangular stick of butter is placed in the microwave 
oven to melt. When the butter’s length is 6 in and its 
square cross section measures 1.5 in ona side, its length 
is decreasing at a rate of 0.25 in/min and its cross- 
sectional edge is decreasing at a rate of 0.125 in/min. 
How fast is the butter melting (i.e., at what rate is the 
solid volume of butter turning to liquid) at that instant? 


. Suppose that the following function is used to model 
the monthly demand for bicycles: 


P(x, y) = 200 + 20V0.1x + 10 — 12,/y. 


In this formula, x represents the price (in dollars 
per gallon) of automobile gasoline and y repre- 
sents the selling price (in dollars) of each bicycle. 
Furthermore, suppose that the price of gasoline t 


10. 
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months from now will be 
mt 
x =1+0.1t — cos — 
6 
and the price of each bicycle will be 
_ at 
y = 200 + 2t sin 6" 


At what rate will the monthly demand for bicycles be 
changing six months from now? 


. The Centers for Disease Control and Prevention pro- 


vides information on the body mass index (BMI) 
to give a more meaningful assessment of a person’s 
weight. The BMI is given by the formula 


10,000w 
BMI = —.—_, 
h? 

where w is an individual’s mass in kilograms and h 

the person’s height in centimeters. While monitoring a 

child’s growth, you estimate that at the time he turned 

10 years old, his height showed a growth rate of 0.6 cm 

per month. At the same time, his mass showed a growth 

rate of 0.4 kg per month. Suppose that he was 140 cm 
tall and weighed 33 kg on his tenth birthday. 

(a) At what rate is his BMI changing on his tenth 
birthday? 

(b) The BMI of a typical 10-year-old male increases 
at an average rate of 0.04 BMI points per month. 
Should you be concerned about the child’s weight 
gain? 


. A cement mixer is pouring concrete in a conical pile. 


At the time when the height and base radius of the 
concrete cone are, respectively, 30 cm and 12 cm, the 
rate at which the height is increasing is 1 cm/min and 
the rate at which the volume of cement in the pile is 
increasing is 320 cm?/min. At that moment, how fast 
is the radius of the cone changing? 


A clarinetist is playing the glissando at the beginning of 
Rhapsody in Blue, while Hermione (who arrived late) 
is walking toward her seat. If the (changing) frequency 
of the note is f and Hermione is moving toward the 
clarinetist at speed v, then she actually hears the fre- 
quency ¢ given by 


c+uvu 
Cc 


o= Í. 


where c is the (constant) speed of sound in air, about 
330 m/sec. At this particular moment, the frequency is 
f = 440 Hz and is increasing at a rate of 100 Hz per 
second. At that same moment, Hermione is moving 
toward the clarinetist at 4 m/sec and decelerating at 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 
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2 m/sec”. What is the perceived frequency ¢ she 
hears at that moment? How fast is it changing? Does 
Hermione hear the clarinet’s note becoming higher or 
lower? 


Suppose z = f(x, y) has continuous partial deriva- 
tives. Let x =e’ cos0, y= e"sinð. Show that 


then 
Oz P az\? L2 | (92 a az\? 
— —] =e — — : 
Ox dy or 00 
Suppose that z= f(x,y) has continuous partial 


derivatives. Let x = 2uv and y = u? + v?. Show that 
then 


dz Oz 2 dz p az\7 44 dz OZ 
— — = ZX — m e T. 
ðu dv ax ay ? Jx ay 

If w=g (u? =y, v= u°) has continuous partial 


derivatives with respect to x = u? — v? and y = v? — 
u, show that 


dw m dw 0 
v — +u — = 0. 
ou dv 


Suppose that z = f(x + y, x — y) has continuous par- 
tial derivatives with respect to u = x + y and v= 
x — y. Show that 


oz Oz az\* az\° 
ax dy \au dv) ` 


Ifw=f (==>) is a differentiable function of 
x2 + y? 


= P show that 
ðw ðw 
x—+y—=0. 
ox dy 
2—y? 
If w = f | ~—- | is a differentiable function of 
x2 +y? 
Day 
u = ———~, show that then 
x2 + y2 
ðw ðw 
x—+y—=0. 
ox dy 
YTX E 


Suppose w = f ( — =) is a differentiable 
z 


xy 


function ofu = x 


—x z—x 
and v = ——. Show then that 
xy XZ 


18. 


Suppose that w = g E ©) is a differentiable func- 
y 


y 3 
tion of u = x/y and v = z/y. Show then that 


In Exercises 19-27, calculate D(f o g) in two ways: (a) by first 
evaluating f o g and (b) by using the chain rule and the deriva- 
tive matrices Df and Dg. 


19. 
20. 
21. 
22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


f(x) = (3x°, e™), g(s,t)=s—Tt 
f(x) = (x, cos 3x, In x), g(s, t, u) = s +°? +w? 
f(x, y) = ye*, g(s,t)=(s—t,s +t) 


f(x, y) = x? — 3y?, g(s, t) = (st, s + t°) 
f(x,y) = (= -254 '), ge) = (Žr) 
xy t 


f(x, y, z) = y + y*z, xyz, e), 
g(t) = (t —2,3t +7, t) 
f(x, y) = (xy?, xy, x? + y’), g(t) = (sinż, e") 


f(x, y) = (x? — y, y/x, &), 
3u, stu) 


g(s, t, u) = (s + 2t + 


f(x, y, z) = (x + y +z, xX? — e%), 
g(s, t, u) = (st, tu, su) 


Let g:R?— R? be a differentiable function 
such that g(1,—1,3)= (2,5) and Dg(1,—1,3) = 


1 -1 0 
4 07 


fined by f(x, y) = (2xy,3x — y + 5). 
D(fog)(1, —1, 3)? 


| Suppose that f:R? — R? is de- 


What is 


Let g: R? — R? and f:R?— R? be differentiable 
functions such that g(0,0)= (1,2) g(1,2)= 
(3,5), f(0,0)= (3,5), f(4,1)= (1,2), Dg(0,0)= 


lea ral patl.2)=| 5 J DIG, 5)= 


È s poren] = a 
(a) Calculate D(f o g)(1, 2). 


(b) Calculate D(g o f)(4, 1). 


Let z= f(x,y), where f has continuous partial 
derivatives. If we make the standard polar/rectangular 
substitution x = r cos 0, y = r sin 0, show that 


0z T 0z a 0z spd azy? 
ax dy) — \ar r2 (30) ` 


31. 


32. 


33. 


34. 


(a) Use the methods of Example 6 and formula (10) 
in this section to determine 07/4x? and 3?/3y? 
in terms of the polar partial differential operators 
a? /dr?, 8/007, Ə? /Ər 6, 9/dr, and 9/40. (Hint: 
You will need to use the product rule.) 

(b) Use part (a) to show that the Laplacian operator 
8*/dx? + 87/dy? is given in polar coordinates by 
the formula 


ə? ə? ə 1ə 1 8? 
əx? ay2 ðr? rðr  r?əð 


Show that the Laplacian operator 97 /3x? + 8? /3y? + 
3? /əz? in three dimensions is given in cylindrical co- 
ordinates by the formula 
2 Ff Ë Ë 13 1# F 
ax2 Əy? Əz? ðr? rər r?əð? əz” 


In this problem, you will determine the formula for the 

Laplacian operator in spherical coordinates. 

(a) First, note that the cylindrical/spherical conver- 
sions given by formula (6) of §1.7 express the 
cylindrical coordinates z and r in terms of the 
spherical coordinates p and ¢ by equations of pre- 
cisely the same form as those that express x and 
y in terms of the polar coordinates r and 0. Use 
this fact to write 0/dr in terms of 0/dp and 3/3. 
(Also see formula (10) of this section.) 


(b) Use the ideas and result of part (a) to establish the 
following formula: 
eo . oF 
əx? dy? az? 
a? n 1 3? 4 1 æ 
~ ðP? p? dg? p? sin? g 30? 


20 coto ð 

pop p? dag. 
Suppose that y is defined implicitly as a function y(x) 
by an equation of the form 


F(x, y)=0. 
(For example, the equation x? — y? = 0 defines y as 
two functions of x, namely, y = x3/? and y = —x3/?. 


The equation sin(xy) — x?y’ + e” = 0, on the other 

hand, cannot readily be solved for y in terms of x. See 

the end of §2.6 for more about implicit functions.) 

(a) Show that if F and y(x) are both assumed to be 
differentiable functions, then 


dy __ Fex.y) 
dx Fy(x, y) 


provided F, (x, y) Æ 0. 
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(b) Use the result of part (a) to find dy/dx when y 
is defined implicitly in terms of x by the equa- 
tion x? — y? = 0. Check your result by explicitly 
solving for y and differentiating. 


35. Find dy/dx when y is defined implicitly by the equa- 
tion sin(xy) — xy’ + e” = 0. (See Exercise 34.) 


36. Suppose that you are given an equation of the form 
F(x, y,z) =9, 


for example, something like xz+ycosz+ 

(sin y)/z = 0. Then we may consider z to be defined 

implicitly as a function z(x, y). 

(a) Use the chain rule to show that if F and z(x, y) are 
both assumed to be differentiable, then 


az F(x, y, Z) dz F(x, y, 2) 
ax F,(x,y,z)’ ay FAR, z) 


(b) Use part (a) to find 0z/dx and dz/dy where z is 
given by the equation xyz = 2. Check your result 
by explicitly solving for z and then calculating the 
partial derivatives. 


37. Find dz/dx and 0z/dy, where z is given implicitly by 
the equation 


sin 
x°z+ycosz+ tai = 0. 


(See Exercise 36.) 


38. Let 


xy 


fæ y= Pt 
0 if (x, y) = (0, 0) 


if (x, y) # (0, 0) 


(a) Use the definition of the partial derivative to find 
fC, 0) and fy, 0). 

(b) Let a be a nonzero constant and let x(t) = 
(t, at). Show that f o x is differentiable, and find 
D(f o x)(0) directly. 

(c) Calculate Df (0, 0)Dx(0). How can you reconcile 
your answer with your answer in part (b) and the 
chain rule? 


Let w = f(x, y, z) be a differentiable function of x, y, and 
z. For example, suppose that w = x + 2y + z. Regarding the 
variables x, y, and z as independent, we have dw/dx = 1 and 
dw/dy = 2. But now suppose that z = xy. Then x, y, and z 
are not all independent and, by substitution, we have that w = 
x+2y+ xy so that dw/dx =1+yanddw/dy = 2 + x. To 
overcome the apparent ambiguity in the notation for partial 
derivatives, it is customary to indicate the complete set of in- 
dependent variables by writing additional subscripts beside 
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the partial derivative. Thus, 


(5), 


would signify the partial derivative of w with respect to x, 
while holding both y and z constant. Hence, x, y, and z are the 
complete set of independent variables in this case. On the other 
hand, we would use (0w/0x), to indicate that x and y alone are 
the independent variables. In the case that w = x + 2y + z, 
this notation gives 


(=) =1, (=) =2, and (=) = 1, 
Ox yz ƏY J xz Oz j 


Ifz = xy, then we also have 


dw dw 
—j}) =l1+y, and —]) =24x. 
Ox y dy x 


In this way, the ambiguity of notation can be avoided. Use this 
notation in Exercises 39-45. 


39. Letw=x+7y— 10z and z = x? + y?. 


. dw dw dw dw 
(a) Find (=) ; (=) 3 (=) > (=) > 
Ox ar dy J xz az ey Ox 7 


(b) Relate (9w/əx)y; and (dw/dx), by using the 
chain rule. 


40. Repeat Exercise 39 where w = x? + y? + 23 and z = 
2x — 3y. 


41. 


42. 


43. 


44. 


45. 


Suppose s = x?y+xzw—z* and xyw — y°z + xz 
= 0. Find 


a a 
(=) and (=) 3 
0z PT ZJ 5 


Let U = F(P, V, T) denote the internal energy of a 
gas. Suppose the gas obeys the ideal gas law PV = kT, 
where k is a constant. 


(a) Find (Gr). 
(b) Find (5), 


(c) Find (2) ; 
aP/y 


Show that ifx, y, z are related implicitly by an equation 
of the form F(x, y, z) = 0, then 


CEL E 


This relation is used in thermodynamics. (Hint: Use 
Exercise 36.) 


The ideal gas law PV = kT, where k is a constant, 
relates the pressure P, temperature T, and volume V 
of a gas. Verify the result of Exercise 43 for the ideal 
gas law equation. 


Verify the result of Exercise 43 for the ellipsoid 
ax? + by? +c? =d 


where a, b, c, and d are constants. 


2.6 Directional Derivatives and the Gradient 


In this section, we will consider some of the key geometric properties of the 


gradient vector 


ope (2, 2L) 


Ox, OX.” OXy 


of a scalar-valued function of n variables. In what follows, n will usually be 2 


or 3. 


The Directional Derivative 


Let f(x, y) bea scalar-valued function of two variables. In §2.3, we understood the 
partial derivative F(a, b) as the slope, at the point (a, b, f(a, b)), of the curve 
obtained as the intersection of the surface z = f(x, y) with the plane y = b. 
The other partial derivative Fa, b) has a similar geometric interpretation. How- 
ever, the surface z = f(x, y) contains infinitely many curves passing through 
(a,b, f(a, b)) whose slope we might choose to measure. The directional deriva- 
tive enables us to do this. 


z=f(x,y) 
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An alternative way to view (a, b) is as the rate of change of f as we move 
“infinitesimally” from a = (a, b) in the i-direction, as suggested by Figure 2.67. 
This is easy to see since, by the definition of the partial derivative, 


of fla +h, b)— Fa, d) 
eo h 
= f((a, b) + (h, 0)) — f(a, b) 
= lim 
h—0 h 
= 4 f(a, b)+ h(l, 0)) — f(a, b) 
= lim 
h—0 h 
_ in LAF A= SOA 
= lim : 
h—0 h 


Note that we are identifying the point (a, b) with the vector a = (a, b) = ai + bj. 
Similarly, we have 

0 hj) — f(a 

f a,b) = lim PCD -1A 
dy h—>0 h 

Writing partial derivatives as we just have enables us to see that they are 

special cases of a more general type of derivative. Suppose v is any unit vector in 
R’. (The reason for taking a unit vector will be made clear later.) The quantity 
. f(athv) — f(a) 
m 


li 
h>0 h 


(1) 


is nothing more than the rate of change of f as we move (infinitesimally) from a = 
(a, b) in the direction specified by v = (A, B) = Ai + Bj. It’s also the slope of the 
curve obtained as the intersection of the surface z = f(x, y) with the vertical plane 
B(x — a) — A(y — b) = 0. (See Figure 2.68.) We can use the limit expression in 
(1) to define the derivative of any scalar-valued function in a particular direction. 


B(x-a)-A(y- 6b) =0 


Figure 2.67 Another way to view the Figure 2.68 The directional derivative. 
partial derivative 0f/dx at a point. 
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DEFINITION 6.1 Let X be open in R”, f: X C R” — R a scalar-valued 
function, anda € X.Ifv € R” is any unit vector, then the directional deriva- 
tive of f at a in the direction of v, denoted D, f (a), is 


f(a + hv) — f(a) 
h 


Dy f(a) = lim 


(provided that this limit exists). 


EXAMPLE 1 Suppose f(x, y) = x? — 3xy + 2x — 5y. Then, if v = (v, w) € 
R? is any unit vector, it follows that 


F(O, 0) + AW, w)) — FO, 0) 


D, (0.0) = jim 


h 
_ bev? — 3h? uw + 2hv — Shw 
= lim 
h—0 h 


= lim (hv? — 3hvw + 2v — 5w) 


= 2v — 5w. 
Thus, the rate of change of f is 2v — 5w if we move from the origin in the 
direction given by v. The rate of change is zero if v = (5/729, 2/429) or 
(—5/V 29, —2/4 29). + 


Consequently, we see that the partial derivatives of a function are just the “tip 
of the iceberg.” However, it turns out that when f is differentiable, the partial 
derivatives actually determine the directional derivatives for all directions v. To 
see this rather remarkable result, we begin by defining a new function F of a 
single variable by 


F(t) = f(a+tv). 
Then, by Definition 6.1, we have 


im f@tM-f@ pp FO-FO _ y 
a ne A fg. e 
That is, 
d 
DfA = = faH), 2) 


The significance of equation (2) is that, when f is differentiable at a, we can apply 
the chain rule to the right-hand side. Indeed, let x(t) = a + tv. Then, by the chain 
rule, 


“Fla + tv) = Df (x)Dx(t) = Df (xv. 


Evaluation at t = 0 gives 


Dy f(a) = Df (a)v = V f(a)» v. (3) 


The purpose of equation (3) is to emphasize the geometry of the situation. The 
result above says that the directional derivative is just the dot product of the 
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gradient and the direction vector v. Since the gradient is made up of the partial 
derivatives, we see that the more general notion of the directional derivative 
depends entirely on just the direction vector and the partial derivatives. To be 
more formal, we summarize this discussion with a theorem. 


THEOREM 6.2 Let X C R” be open and suppose f: X — R is differentiable 
at a € X. Then the directional derivative Dy f(a) exists for all directions (unit 
vectors) v € R” and, moreover, we have 


Dy f(a) = V f(a). v. 


EXAMPLE 2 The function f(x, y) = x? — 3xy + 2x — 5y we considered in 
Example | has continuous partials and hence, by Theorem 3.5, is differentiable. 
Thus, Theorem 6.2 applies to tell us that, for any unit vector v = vi + wj € R’, 


Dy f (0, 0) = V f (0, 0) -v = (f.(0, 0)i + ALC, 0)j) - (vi + wj) 
= (2i — 5j)-(vi+ wj) 


= 2v — 5w, 


as seen earlier. + 


EXAMPLE 3 The converse of Theorem 6.2 does not hold. That is, a function 
may have directional derivatives in all directions at a point yet fail to be differ- 
entiable. To see how this can happen, consider the function f: R? — R defined 
by 


xy? 


fœ, y= ea if (x, y) # (0, 0) 
0 if(x, y) = (0,0) 


This function is not continuous at the origin. (Why?) So, by Theorem 3.6, it 
fails to be differentiable there; however, we claim that all directional derivatives 
exist at the origin. To see this, let the direction vector v be vi + wj. Hence, by 
Definition 6.1, we observe that 


f(O, 0) + h(vi+ wj)) — FO, 0) 
h 


1 hv(hwy r 
“aboh io + (hwy! | 


Dy f (0, 0) = lim 


h? vw? 


= im ——_-——_ 
h>0 h?(v? + h?w*) 
i vw? vw? w? 
= lim = . 
h>0 v2 + h2w4 v2 v 
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Thus, the directional derivative exists whenever v 4 0. When v = 0 (in which 
case v = j), we, again, must calculate 


F(O, 0) + Aj) — FO, 0) 


D; f0, 0) = lim 


h 
tin FOW- SOO 
= lim 

h—>0 h 
ee. 4 
Z ae ee 


Consequently, this directional derivative (which is, in fact, df/dy) exists as well. 
+ 


The reason we have restricted the direction vector v to be of unit length in our 
discussion of directional derivatives has to do with the meaning of D, f(a), not 
with any technicalities pertaining to Definition 6.1 or Theorem 6.2. Indeed, we 
can certainly define the limit in Definition 6.1 for any vector v, not just one of unit 
length. So, suppose w is an arbitrary nonzero vector in R” and f is differentiable. 
Then the proof of Theorem 6.2 goes through without change to give 
. f(athw) — f(a) 
m 


li 
h—0 h 


= Vf(a)-w. 


The problem is as follows: If w = kv for some (nonzero) scalar k, then 


om fat hw) — F@) 


m h 7 An 
= V f(a): (ky) 
= k(V f(a) - v) 
( . f(a+ hy) = £a) 
= k | lim i 
h>0 h 


That is, the “generalized directional derivative” in the direction of kv is k times 
the derivative in the direction of v. But v and kv are parallel vectors, and it is 
undesirable to have this sort of ambiguity of terminology. So we avoid the trouble 
by insisting upon using unit vectors only (i.e., by allowing k to be +1 only) when 
working with directional derivatives. 


Gradients and Steepest Ascent 


Suppose you are traveling in space near the planet Nilrebo and that one of your 
spaceship’s instruments measures the external atmospheric pressure on your ship 
as a function f(x, y, z) of position. Assume, quite reasonably, that this function 
is differentiable. Then Theorem 6.2 applies and tells us that if you travel from 
point a = (a, b, c) in the direction of the (unit) vector u = ui + vj + wk, the rate 
of change of pressure is given by 


Du f(a) = V f(a)-u. 


Now, we ask the following: In what direction is the pressure increasing the most? 
If 0 is the angle between u and the gradient vector V f(a), then we have, by 
Theorem 3.3 of §1.3, that 


Du f(a) = |V fal] llul] cos 8 = |V f(a)l| cos 8, 
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since u is a unit vector. Because —1 < cos@ < 1, we have 


IV fall S Duf(@) < IV fall. 


Moreover, cos@ = 1 when 6 = 0 and cos = —1 when 0 = z. Thus, we have 
established the following: 


THEOREM 6.3 The directional derivative D, f(a) is maximized, with respect to 
direction, when u points in the same direction as V f(a) and is minimized when u 
points in the opposite direction. Furthermore, the maximum and minimum values 
of Da f (a) are || V f(a)|| and — || V f(a)||, respectively. 


EXAMPLE 4 Ifthe pressure function on Nilrebo is 
fQYD= 5x2 + Ty4 + x22 atm, 


where the origin is located at the center of Nilrebo and distance units are measured 
in thousands of kilometers, then the rate of change of pressure at (1, —1, 2) 
in the direction of i+ j +k may be calculated as V f(1, —1, 2)+u, where u = 
(i+ j + k)/ v3. (Note that we normalized the vector i+ j + k to obtain a unit 
vector.) Using Theorem 6.2, we compute 


Daf, —1,2) = V f(1, —1,2)-u 


ret 
(Rio 4a. 
J3 
18-2844 
z = —2,/3 atm/Mm. 
V3 


Additionally, in view of Theorem 6.3, the pressure will increase most rapidly 
in the direction of V f(1, —1, 2), that is, in the 


18i—28j+4k  9i— 14j +2k 
18i — 28j + 4k|| ~v 281 


direction. Moreover, the rate of this increase is 


IV fC, —1, 2)|| = 2V281 atm/Mm. e 


Theorem 6.3 is stated in a manner that is independent of dimension—that is, so 
that it applies to functions f: X C R” — Rforanyn > 2. Inthe casen = 2, there 
is another geometric interpretation of Theorem 6.3: Suppose you are mountain 
climbing on the surface z = f(x, y). Think of the value of f as the height of the 
mountain above (or below) sea level. If you are equipped with a map and compass 
(which supply information in the xy-plane only), then if you are at the point on 
the mountain with xy-coordinates (map coordinates) (a, b), Theorem 6.3 says 
that you should move in the direction parallel to the gradient V f(a, b) in order to 
climb the mountain most rapidly. (See Figure 2.69.) Similarly, you should move 
in the direction parallel to —V f(a, b) in order to descend most rapidly. Moreover, 
the slope of your ascent or descent in these cases is || V f(a, b)||. Be sure that you 
understand that V f(a, b) is a vector in R? that gives the optimal north-south, 
east—west direction of travel. 
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Figure 2.69 Select V f(a, b)/||V f(a, b)|| for direction of steepest ascent. 


Tangent Planes Revisited - 


In §2.1, we indicated that not all surfaces can be described by equations of the 
form z = f(x, y). Indeed, a surface as simple and familiar as the sphere 1s not the 
graph of any single function of two variables. Yet the sphere is certainly smooth 
enough for us to see intuitively that it must have a tangent plane at every point. 
(See Figure 2.70.) 

How can we find the equation of the tangent plane? In the case of the unit 
sphere x? + y? + z? = 1, we could proceed as follows: First decide whether the 
point of tangency is in the top or bottom hemisphere. Then apply equation (4) of 


Figure 2.70 A sphere andoneof 82.3 to the graph of z = y 1 — x? — y? or z = —,/1 — x? — y?, as appropriate. 
its tangent planes. The calculus is tedious but not conceptually difficult. However, the tangent planes 


to points on the equator are all vertical and so equation (4) of §2.3 does not apply. 
(It is possible to modify this approach to accommodate such points, but we will 
not do so.) In general, given a surface described by an equation of the form 
F(x, y, z) = c (where c is a constant), it may be entirely impractical to solve 
for z even as several functions of x and y. Try solving for z in the equation 
xyz + ye** — x? + yz? = 0 and you’ll see what we mean. We need some other 
way to get our hands on tangent planes to surfaces described as level sets of 
functions of three variables. 

To get started on our quest, we present the following result, interesting in its 
own right: 


THEOREM 6.4 Let X C R” be open and f: X — R be a function of class C!. 
If xo is a point on the level set S$ = {x € X | f(x) = c}, then the vector V f (Xo) 
is perpendicular to S. 
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PROOF We need to establish the following: If v is any vector tangent to S at xo, 
then V f (xo) is perpendicular to v (i.e., V f(xo)+ v = 0). By a tangent vector to $ 
at xo, we mean that v is the velocity vector of a curve C that lies in S and passes 
through Xo. The situation in R? is pictured in Figure 2.71. 


Figure 2.71 The level set surface 
S= {x | f(x) =c). 


Thus, let C be given parametrically by x(t) = (x1(t), x2(t), ..., Xn(t)), where 
a <t < band x(t)) = Xo for some number fg in (a, b). (Then, if v is the velocity 
vector at Xp, we must have x'(tọ) = v. See §3.1 for more about velocity vectors.) 
Since C is contained in S, we have 


SOO) = fA), x2(t), ..., Xn(t)) =. 


Hence, 


d d 
az PaO = 7 le] = 0. (4) 


On the other hand, the chain rule applied to the composite function f o x: 
(a, b) — R tells us 


d 
Er [f(x())] = VEA- xA. 
Evaluation at tọ and equation (4) let us conclude that 
V f (x(to)) + x'(to) = Vf (Xo) v = 0, 


as desired. E 


Here’s how we can use the result of Theorem 6.4 to find the plane tan- 
gent to the sphere x? + y? + z? = 1 at the point (-+. 0, 4). From §1.5, we 


know that a plane is determined uniquely from two pieces of information: (1) a 
point in the plane and (ii) a vector perpendicular to the plane. We are given a 


point in the plane in the form of the point of tangency (-+. 0, 4) As for 


a vector normal to the plane, Theorem 6.4 tells us that the gradient of the func- 
tion f(x, y, z) = x? + y? + z? that defines the sphere as a level set will do. We 
have 


V f(x,y,z) = 2xi+ 2yj + 2zk, 


so that 


vr( w“ =)= V2i+ V2k. 
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Hence, the equation of the tangent plane is 


erea) (e)a 


afaa) 


z- x= v2. 


or 


In general, if S is a surface in R? defined by an equation of the form 


fœ, y,z)=c, 


then if x9 € X, the gradient vector V f (Xo) is perpendicular to S and, conse- 
quently, if nonzero, is a vector normal to the plane tangent to S at xo. Thus, 
the equation 


V f (Xo) + (X — Xo) = 0 
or, equivalently, 


Fx (Xo, Yo, Zo)\(X — xo) + fy(xo, Yo. Zo — Yo) 


+ f(%0, yo, ZoX(z — Zo) = 0 


is an equation for the tangent plane to S at xo. 


Note that formula (5) can be used in R” as well as in R?, in which case it 
defines the tangent hyperplane to the hypersurface S$ C R” defined by f(x, 
X2,...,X,) = C at the point xp € S. 


EXAMPLE 5 Consider the surface S defined by the equation x? y — yz? + z5 = 
9. We calculate the plane tangent to S at the point (3, — 1, 2). 
To do this, we define f(x, y, z) = x°y — yz? + 2°. Then 


ViG)1)2)— Gey 262 = 292k) | 5 245, 


= —27i + 23j + 84k 


is normal to S at (3, —1, 2) by Theorem 6.4. Using formula (6), we see that the 
tangent plane has equation 


27(x — 3) + 23(y + 1) + 84(z — 2) =0 


or, equivalently, 
—27x + 23y + 84z = 64. + 


EXAMPLE 6 Consider the surface defined by zt = x? + y?. This surface is 
the level set (at height 0) of the function 


Tengara er 
The gradient of f is 
V f(x, y, Z) = 2xit+ 2yj — 42k. 


Figure 2.72 The 
surface of Example 6. 
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Note that the point (0, 0, 0) lies on the surface. However, V f(0, 0, 0) = 0, which 
makes the gradient vector unusable as a normal vector to a tangent plane. Thus, 
formula (6) doesn’t apply. What we conclude from this example is that the surface 
fails to have a tangent plane at the origin, a fact that is easy to believe from the 
graph. (See Figure 2.72.) + 


EXAMPLE 7 The equation x? + y? + z? + w? = 4 defines a hypersphere of 
radius 2 in R4. We use formula (5) to determine the hyperplane tangent to the 
hypersphere at (—1, 1, 1, —1). 
The hypersphere may be considered to be the level set at height 4 of the 
function f(x, y, z, w) = x? + y? + z? + w”, so that the gradient vector is 
VF, y, Z, w) = (2x, 2y, 2z, 2w), 
so that 
V f(-1, 1,1, -1) = (-2, 2, 2, —2). 
Using formula (5), we obtain an equation for the tangent hyperplane as 


(—2, 2,2, -2)-(*+1,y—l1,z-—lhw+l=0 


or 


2x + 1)+2(y — 1) + 2(2 - 1D -2(w+ 1) =0. 
Equivalently, we have the equation 
x-y-ztw+4=0. + 
EXAMPLE 8 We determine the plane tangent to the paraboloid z = x? + 3y? 
at the point (—2, 1, 7) in two ways: (i) by using formula (4) in §2.3, and (ii) by 
using our new formula (6). 
First, the equation z = x? + 3y? explicitly describes the paraboloid as the 
graph of the function f(x, y) = x? + 3y?, that is, by an equation of the form 


z = f(x, y). Therefore, formula (4) of §2.3 applies to tell us that the tangent 
plane at (—2, 1, 7) has equation 


z = f2, 1) + f2, D@ + 2) + H(-2, DO -= 1) 


or, equivalently, 


z =7— 4x +2) + 6y — 1). (7) 


Second, if we write the equation of the paraboloid as x? + 3y? — z = 0, 
then we see that it describes the paraboloid as the level set of height 0 of the 
three-variable function F(x, y, z) = x? + 3y* — z. Hence, formula (6) applies 
and indicates that an equation for the tangent plane at (—2, 1, 7) is 


F,(—2, 1, DŒ + 2) + Fy(=2, 1, DO = 1) + F-2,1, DG -7)=0 


or 


4(x +2)+6(y—1)—1I(z-7)=0. (8) 
As can be seen, equation (7) agrees with equation (8). + 
Example 8 may be viewed in a more general context. If S is the surface in R? 


given by the equation z = f(x, y) (where f is differentiable), then formula (4) of 
§2.3 tells us that an equation for the plane tangent to S at the point (a, b, f(a, bY) is 


z= f(a, b) + frla, bx — a) + fra, b) — b). 
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(1, 1, -2V3) 


Figure 2.73 The two-sheeted 
hyperboloid 27/4 — x? — y? = 1. 
The point (—2, 2, 6) lies on the 
sheet given by z = 2/x?+ y? + 1, 
and the point (1, 1, —2/3) lies on 
the sheet given by 


z= —2/x? + y? +1. 


At the same time, the equation for S may be written as 


fœ, y)—-z=0. 


Then, if we let F(x, y, z) = f(x, y) — z, we see that S is the level set of F at 
height 0. Hence, formula (6) tells us that the tangent plane at (a, b, f(a, b)) is 


F(a, b, f(a, b)(x — a) + Fy(a, b, f(a, bD) — b) 
+ F(a, b, f(a, bz — f(a, b)) = 0. 


By construction of F, 
OF _ of OF _ of OF 
dy Əy 


Thus, the tangent plane formula becomes 


f(a, b\(x — a) + f(a, by — b) — (z — f(a, b)) = 9. 
The last equation for the tangent plane is the same as the one given above by 
equation (4) of §2.3. 
The result shows that equations (5) and (6) extend the formula (4) of §2.3 to 
the more general setting of level sets. 


The Implicit Function and Inverse Function 
Theorems (optional) 


We have previously noted that not all surfaces that are described by equations of 
the form F(x, y, z) = c can be described by an equation of the form z = f(x, y). 
We close this section with a brief—but theoretically important—digression about 
when and how the level set {(x, y, z) | F(x, y, z) = c} can also be described as 
the graph of a function of two variables, that is, as the graph of z = f(x, y). 
We also consider the more general question of when we can solve a system of 
equations for some of the variables in terms of the others. 
We begin with an example. 


EXAMPLE 9 Consider the hyperboloid z?/4 — x? — y? = 1, which may be 
described as the level set (at height 1) of the function 


z 2 
Fœ yz) = 7 —% 


— y? A 

(See Figure 2.73.) This surface cannot be described as the graph of an equation 
ofthe form z = f(x, y), since particular values for x and y give rise to two values 
for z. Indeed, when we solve for z in terms of x and y, we find that there are two 
functional solutions: 


z=2yx? +y? +1 and z= —2yx? +y? +1. (9) 


On the other hand, these two solutions show that, given any particular point 
(xo, Yo, zo) of the hyperboloid, we may solve locally for z in terms of x and y. 
That is, we may identify on which sheet of the hyperboloid the point (xo, yo, Zo) 
lies and then use the appropriate expression in (9) to describe that sheet. + 


Example 9 prompts us to pose the following question: Given a surface S, 
described as the level set {(x, y, z) | F(x, y, z) = c}, can we always determine at 
least a portion of S as the graph ofa function z = f(x, y)? The result that follows, 
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a special case of what is known as the implicit function theorem, provides 
relatively mild hypotheses under which we can. 


THEOREM 6.5 (THE IMPLICIT FUNCTION THEOREM) Let F: X C R” > R be 
of class C! and let a be a point of the level set S = {x € R” | F(x) = c}. If 
F, (a) Æ 0, then there is a neighborhood U of (a1, a2,...,@n—1) in R""!, a 
neighborhood V of a, in R, and a function f:U C R"! —> V of class C! 
such that if (x1, x2,...,%,-1) E U and x, € V satisfy F(x1, X2, ... , Xn) = c(ie., 
(X1,X2,---,Xn) € S), then x, = f (X1, X2,.--,Xn—1)- 


The significance of Theorem 6.5 is that it tells us that near a point a € S 
such that dF /dx, Æ 0, the level set S given by the equation F(x),...,%,) = € 
is locally also the graph of a function x, = f(x), ...,X,—1). In other words, we 
may solve locally for x, in terms of x|,..., X,—1, so that S is, at least locally, a 
differentiable hypersurface in R”. 


EXAMPLE 10 Returning to Example 9, we recall that the hyperboloid is the 
level set (at height 1) of the function F(x, y, z) = z?/4 — x? — y?. We have 

OF z 

3z 2 
Note that for any point (xo, yo, zo) in the hyperboloid, we have |zo| > 2. Hence, 
ð F:-(xo, Yo, zo) Æ 0. Thus, Theorem 6.5 implies that we may describe a portion 
of the hyperboloid near any point as the graph of a function of two variables. This 
is consistent with what we observed in Example 9. + 


Of course, there is nothing special about solving for the particular vari- 
able x, in terms of x1, ...,Xn—-1. Suppose a is a point on the level set S de- 
termined by the equation F(x) = c and suppose V F(a) # 0. Then Fy, (a) 4 0 for 
some 7. Hence, we can solve locally near a for x; as a differentiable function of 
X1,...,Xj-1,Xj41,---,Xn. Therefore, S is locally a differentiable hypersurface 
in R”. 


EXAMPLE 11 Let S denote the ellipsoid x7/4 + y?/36 + z?/9 = 1. Then S 
is the level set (at height 1) of the function 


F(x E a ae 
ee 4 36. g 
At the point (/2, V6, 4/3), we have 
OF 2 243 
82 I, V5, V3) 9 JINEN) 9 


Thus, S may be realized near (/2, 6, V3) as the graph of an equation of the 
form z = f(x, y), namely, z = 3,/1 — x2/4 — y?/36. At the point (0, —6, 0), 
however, we see that ð F /ðz vanishes. On the other hand, 
OF 
ay 


_ 2y 
(0,—6,0) 36 


es 


(0,—6,0) 3 


Consequently, near (0, —6, 0), the ellipsoid may be described by solving for y as 
a function of x and z, namely, y = —6,/1 — x? /4 — z? /9. + 
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EXAMPLE 12 Consider the set of points S defined by the equation x*z* — y = 
0. Then S is the level set at height 0 of the function F(x, y, z) = x?z? — y. Note 
that 


V F(x, y, z) = (2xz”, —1, 2x*z). 


Since 0 F/ðy never vanishes, we see that we can always solve for y as a function 
of x and z. (This is, of course, obvious from the equation.) On the other hand, near 
points where x and z are nonzero, both 0 F'/dx and ð F/ðz are nonzero. Hence, 
we can solve for either x or z in this case. For example, near (1, 1, —1), we have 


i= 2 and z= -— = ? 
yz a 


As just mentioned, Theorem 6.5 is actually a special case of a more general 
result. In Theorem 6.5 we are attempting to solve the equation 


F(x), %2,...,%n) =C 
for x, in terms of x1,...,X,—1. In the general case, we have a system of m 
equations 
F(x, aneri, s Xn, yi; E Ym) = C1 
ee R , (10) 
Fin(X1, see Xns Viseees Ym) = Cm 
and we desire to solve the system for y1, . . . , Ym in terms of x1, . . . , Xn. Using vec- 
tor notation, we can also write this system as F(x, y) = c, where x = (x1, ..., Xn), 
y = (Yi; ---, Ym) C= (C1, ---, Cm), and Fi,..., Fm make up the component 


functions of F. With this notation, the general result is the following: 


THEOREM 6.6 (THE IMPLICIT FUNCTION THEOREM, GENERAL CASE) Suppose 
F: A — R” is of class C!, where A is open in R”™, Let (a, b) = (a1, . . - , an, 
bi, ..., bm) € A satisfy F(a, b) = c. If the determinant 


OF OF; 
=—(a,b) --+ ——(a,b) 
Oy, dYm 
A(a, b) = det : E : Æ 0, 
a Fm OF m 
(a, b) ER ——(a, b) 
dy) Ym 


then there is a neighborhood U of a in R” and a unique function f: U —> R” of 
class C! such that f(a) = b and F(x, f(x)) = ¢ for all x € U. In other words, we 
can solve locally for y as a function f(x). 


EXAMPLE 13 We show that, near the point (x1, x2, x3, Y1, Y2) = (—1, 1,1, 
2, 1), we can solve the system 
X1y2 + x271 = 1 
(11) 


xix3y1 + x2y3 = 3 


for yı and yz in terms of x1, x2, x3. 
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We apply the general implicit function theorem (Theorem 6.6) to the system 
F\(%1, x2, X3, y1, Y2) = x1 y2 + X21 = 1 
Fy(x1, X2, X3, Yi, Y2) = x?x3y1 + x2yZ = 3 


The relevant determinant is 


OF, OF, 
A(-1,1,1,2,1) = det| ° 9% 
OF, OF) 
a | 
L 7i %2 (%1,%2,.%3,91,92)=(—1,1,1,2,1) 
X2 x} 
= det 
Pre 3x2y5 


(41,%2,.%3,91,92)=(-1,1,1,2,1) 


1 =l 
= et] | , |=440 


Hence, we may solve locally, at least in principle. 

We can also use the equations in (11) to determine, for example, 
dy2 : . 
Sa, 1, 1), where we treat x1, x2, x3 as independent variables and yı and 

Xi 

y2 as functions of them. 

Differentiating the equations in (11) implicitly with respect to xı and using 
the chain rule, we obtain 


gin rr SO 
= "ax 23x 
a) 0 
2x1x3y1 4 x?x3 A | 3x272 2 = 0 
Ox] Ox 


Now, let (x1, x2, x3, Y1, Y2) = (—1, 1, 1, 2, 1), so that the system becomes 


3 3 
i ys eiai 
Ox] Ox] 


ð ð Í 
Di1,1, )+321,1,1)=4 
Ox] Ox) 


: . ə 5 
We may easily solve this last system to find that F 1t T)= 7 + 
X] 


Now, suppose we have a system of n equations that defines the variables 
Y1, -.-, Yn in terms of the variables x1, ... , Xn, that is, 


y= Sila, ---5 Xn) 


Y2 = fats -ees Xn) (12) 


Yn = Saxi, 1.’ Xn) 


Note that the system given in (12) can be written in vector form as y = f(x). The 
question we ask is, when can we invert this system? In other words, when can we 
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solve for x1, ...,X, in terms of yj,..., Yn, or, equivalently, when can we find a 
function g so that x = g(y)? 
The solution is to apply Theorem 6.6 to the system 


Filis sssi tag iy soos Ve) =O 
Fo(X1,---5 Xn Yiee Yn) = 0 


Fn (x1, sawa Me Visnes Ya) =O 
where F;(x1,..-,Xn,V1,--+>¥n) = fiX, ---; Xn) — yi. (In vector form, we are 
setting F(x, y) = f(x) — y.) Then solvability for x in terms of y near x = a, y = b 
is governed by the nonvanishing of the determinant 


afi afi 
ax, 3x, (a) 
det Df(a) = det : ES : : 
dfn Ofn 
| Fe a | 
This determinant is also denoted by 
afi. e.r’ fa) 
O(Xis 6259 Xn) lia 
and is called the Jacobian of f = (fi, ..., fn). A more precise and complete 


statement of what we are observing is the following: 


THEOREM 6.7 (THE INVERSE FUNCTION THEOREM) Suppose f = (fi, ..., fn) 
is of class C! on an open set A C R”. If 
(fiss Sn 
det Df(a) = Opts fn) # 0, 


Ilkin Xn) ley 


then there is an open set U C R” containing a such that f is one-one on U, the 
set V = f(U) is also open, and there is a uniquely determined inverse function 
g: V — U tof, which is also of class C!. In other words, the system of equations 
y = f(x) may be solved uniquely as x = g(y) for x near a and y near b. 


EXAMPLE 14 Consider the equations that relate polar and Cartesian coordi- 
nates: 


x =rcosé 
y=rsinð ` 


These equations define x and y as functions of r and 0. We use Theorem 6.7 to 
see near which points of the plane we can invert these equations, that is, solve for 
r and @ in terms of x and y. 

To use Theorem 6.7, we compute the Jacobian 


ax, y) 
a(r,0) 


sind rcosé 


cos -~r sin | 


Thus, we see that, away from the origin (r = 0), we can solve (locally) for r and 6 
uniquely in terms of x and y. At the origin, however, the inverse function theorem 
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does not apply. Geometrically, this makes perfect sense, since at the origin the 
polar angle 0 can have any value. + 


2.6 Exercises 


1. Suppose f(x, y, z) is a differentiable function of three 
variables. 
(a) Explain what the quantity V f(x, y, z) *(—k) rep- 
resents. 
(b) How does V f(x, y, z)*(—k) relate to 0f/0z? 


In Exercises 2-8, calculate the directional derivative of the 
given function f at the point a in the direction parallel to the 
vector U. 


3-4 
2. f(x, y) =e! sinx, a = (Z, 0), u = a4 


3 10 
i+ 2j 
3. f(x,y) =x? —2x3y +2y3,a=(2,-1,u= - 
fœ, y) y+2y ( ) T 
4. fæ, y) = G,-2),u=i-j 
. f(x,y)= ,a= G6, ,u=i 
"= EFA i 
5. f(x, y) = e — x?y,a = (1,2), u = 2i + j 
2k—i 
6. f(x,y,z) = xyz, a = (—1, 0, 2), u = 
fœ, y, z) = xy ) T 
7. f(x,y, z) = et a= (1,2,3, u=i+j+k 
E T E (2, -1, 0) i—2j+ 
«JX, = >, a= (2, —1, U0), u=1— 
z E321 i 


9. For the function 


xly| 
{BVH ive rr 


0 if (x, y) = (0, 0) 


if (x, y) # (0, 0) 


(a) calculate f,(0, 0) and f,(0, 0). (You will need to 
use the definition of the partial derivative.) 


(b) use Definition 6.1 to determine for which unit 
vectors v= vi + wj the directional derivative 
Dy f (0, 0) exists. 


V (c) use a computer to graph the surface z = f(x, y). 


10. For the function 


hess re if (x, y) #0, 0) 
x,y= J , 
0 if (x, y) = (0, 0) 


(a) calculate f,(0, 0) and f,(0, 0). 


(b) use Definition 6.1 to determine for which unit 
vectors v= vi+ wj the directional derivative 
Dy f (0, 0) exists. 


Q (c) use a computer to graph the surface z = f(x, y). 


11. The surface of Lake Erehwon can be represented by a 
region D in the xy-plane such that the lake’s depth (in 
meters) at the point (x, y) is given by the expression 
400 — 3x?y?. If your calculus instructor is in the wa- 
ter at the point (1, —2), in which direction should she 
swim 
(a) so that the depth increases most rapidly (i.e., so 

that she is most likely to drown)? 


(b) so that the depth remains constant? 


12. A ladybug (who is very sensitive to temperature) is 
crawling on graph paper. She is at the point (3, 7) and 
notices that if she moves in the i-direction, the tem- 
perature increases at a rate of 3 deg/cm. If she moves 
in the j-direction, she finds that her temperature de- 
creases at arate of 2 deg/cm. In what direction should 
the ladybug move if 
(a) she wants to warm up most rapidly? 

(b) she wants to cool off most rapidly? 


(c) she desires her temperature not to change? 


13. You are atop Mt. Gradient, 5000 ft above sea level, 
equipped with the topographic map shown in Fig- 
ure 2.74. A storm suddenly begins to blow, necessitat- 
ing your immediate return home. If you begin heading 
due east from the top of the mountain, sketch the path 
that will take you down to sea level most rapidly. 


14. It is raining and rainwater is running off an ellipsoidal 
dome with equation 4x? + y? +42? = 16, where 
z > 0. Given that gravity will cause the raindrops to 
slide down the dome as rapidly as possible, describe 
the curves whose paths the raindrops must follow. 
(Hint: You will need to solve a simple differential 
equation.) 


15. Igor, the inchworm, is crawling along graph paper in 
a magnetic field. The intensity of the field at the point 
(x, y) is given by M(x, y) = 3x7 + y? + 5000. If Igor 
is at the point (8, 6), describe the curve along which he 
should travel if he wishes to reduce the field intensity 
as rapidly as possible. 


In Exercises 16-19, find an equation for the tangent plane to 
the surface given by the equation at the indicated point (xo, yo, 
zo). 
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16. 
17. 
18. 
19. 
20. 


21. 


22. 


23. 


24. 


25. 
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You are here 


Figure 2.74 The topographic map of Mt. Gradient in Exercise 13. 


x? + y? +23 = 7, (xo, Yo, Zo) = (0, —1, 2) 

ze” cosx = 1, (xo, yo, zo) = (77, 0, —1) 

2xz + yz — x°y + 10 = 0, (xo, yo, Zo) = (1, —5, 5) 

2xy? = 2z? = xyz, (xo, Yo, Z0) = (2, —3, 3) 

Calculate the plane tangent to the surface whose equa- 

tion is x? — 2y? + 5xz = 7 at the point (—1, 0, —§) in 

two ways: 

(a) by solving for z in terms of x and y and using 
formula (4) in §2.3 

(b) by using formula (6) in this section. 


Calculate the plane tangent to the surface x sin y + 

xz? = 2e” at the point (2, 7, 0) in two ways: 

(a) by solving for x in terms of y and z and using a 
variant of formula (4) in §2.3 


(b) by using formula (6) in this section. 
Find the point on the surface x° — 2y? +z? = 27 
where the tangent plane is perpendicular to the line 


given parametrically as x = 3t — 5, y = 2t +7, z = 
1— V2. 


Find the points on the hyperboloid 9x? — 45y? + 
5z* = 45 where the tangent plane is parallel to the 
plane x + 5y — 2z = 7. 


Show that the surfaces z = 7x? — 12x — 5y? and 
xyz? = 2 intersect orthogonally at the point (2, 1, —1). 


Suppose that two surfaces are given by the equations 


F(x,y,z)=c and G, y,z)=k. 


Moreover, suppose that these surfaces intersect at the 
point (xo, yo, Zo). Show that the surfaces are tangent at 
(xo, Yo, Zo) if and only if 


VF (xo, Yo. zo) X VG(xo, Yo, Zo) = 9. 


26. Let S denote the cone x? + 4y? = z?. 


(a) Find an equation for the plane tangent to S at the 
point (3, —2, —5). 

(b) What happens if you try to find an equation for a 
tangent plane to S at the origin? Discuss how your 
findings relate to the appearance of S. 


27. Consider the surface S defined by the equation x? — 


ryt+2=0. 

(a) Find an equation for the plane tangent to S at the 
point (2, —3/2, 1). 

(b) Does S have a tangent plane at the origin? Why or 
why not? 


Ifa curve is given by an equation of the form f(x, y) = 0, then 
the tangent line to the curve at a given point (xo, yo) on it may 
be found in two ways: (a) by using the technique of implicit 
differentiation from single-variable calculus and (b) by using 
a formula analogous to formula (6). In Exercises 28—30, use 
both of these methods to find the lines tangent to the given 
curves at the indicated points. 


28. x? + y? = 4, (xo, yo) = (— 72, V2) 
29. y? = x? + x°, (xo, yo) = (1, V2) 
30. x° + 2xy + y? = 16, (xo, yo) = (2, —2) 


Let C be a curve in R? given by an equation of the form 
f(x, y) = 0. The normal line to C at a point (xo, yo) on it 
is the line that passes through (xo, yo) and is perpendicular 
to C (meaning that it is perpendicular to the tangent line to 
C at (xo, yo)). In Exercises 31—33, find the normal lines to 
the given curves at the indicated points. Give both a set of 
parametric equations for the lines and an equation in the form 
Ax + By = C. (Hint: Use gradients.) 


31. 
32. 
33. 
34. 


35. 


36. 


37. 


38. 


39. 


x? — y? = 9, (xo, yo) = (5, —4) 
x? = x? = y?, (xo, yo) = (—1, V2) 
x? —2xy + y% = 11, (xo, yo) = (2, —1) 


This problem concerns the surface defined by the equa- 
tion 


xz + ry + sin (yz) = —3. 


(a) Find an equation for the plane tangent to this sur- 
face at the point (—1, 0, 3). 

(b) The normal line to a surface S in R? at a point 
(xo, Yo, Zo) on it is the line that passes through 
(xo, Yo. Zo) and is perpendicular to S. Find a set 
of parametric equations for the line normal to the 
surface given above at the point (—1, 0, 3). 


Give a set of parametric equations for the normal line to 
the surface defined by the equation e*” + e** — 2e” = 
0 at the point (—1, —1, —1). (See Exercise 34.) 


Give a general formula for parametric equations for 
the normal line to a surface given by the equation 
F(x, y, z) = 0 at the point (xo, yo, Zo) on the surface. 
(See Exercise 34.) 


Generalizing upon the techniques of this section, 
find an equation for the hyperplane tangent to 
the hypersurface sinxı + cos x2 + sin x3 + cos x4 + 
sinxs = — 1 at the point (x, x, 37/2, 27, 277) € RS. 
Find an equation for the hyperplane tangent to the 
(n — 1)-dimensional ellipsoid 
1 

x? + 2x2 + 3x + oo nx = ntd 

at the point (—1, —1,..., —1) € R”. 


Find an equation for the tangent hyperplane to the (n — 
1)-dimensional sphere x? + x? +- --+x2 = 1 in R” 


at the point (1/./n, 1/vn, ..., 1/./n, —1/vn). 


Exercises 40-49 concern the implicit function theorems and 
the inverse function theorem (Theorems 6.5, 6.6, and 6.7). 


40. 


Let S be described by z? y? + x?y = 2. 

(a) Use the implicit function theorem to determine 
near which points S can be described locally as 
the graph of a C! function z = f(x, y). 


41. 
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(b) Near which points can S be described (locally) as 
the graph of a function x = g(y, z)? 

(c) Near which points can S be described (locally) as 
the graph of a function y = h(x, z)? 

Let S be the set of points described by the equation 

sinxy +e% +x7y = l. 

(a) Near which points can we describe S as the graph 
ofa C! function z = f(x, y)? What is f(x, y) in 
this case? 

(b) Describe the set of “bad” points of S, that is, the 


points (xo, yo, zo) € S where we cannot describe 
S as the graph of a function z = f(x, y). 


Q (c) Use a computer to help give a complete picture of 
S. 


42. 


43. 


44. 


45. 


46. 


Let F(x, y) =c define a curve C in R?. Suppose 
(xo, yo) isa point of C such that V F (xo, yo) Æ 0. Show 
that the curve can be represented near (xo, yo) as either 
the graph of a function y = f(x) or the graph of a 
function x = g(y). 


Let F(x, y) = x? — y°, and consider the curve C de- 
fined by the equation F(x, y) = 0. 


(a) Show that (0, 0) lies on C and that F,(0, 0) = 0. 

(b) Can we describe C as the graph of a function 
y = f(x)? Graph C. 

(c) Comment on the results of parts (a) and (b) in light 
of the implicit function theorem (Theorem 6.5). 


(a) Consider the family of level sets of the function 
F(x, y) = xy + 1. Use the implicit function theo- 
rem to identify which level sets of this family are 
actually unions of smooth curves in R? (i.e., locally 
graphs of C! functions of a single variable). 


(b) Now consider the family of level sets of 


F(x, y,z)= xyz + 1. Which level sets of this 
family are unions of smooth surfaces in R3? 


Suppose that F(u, v) is of class C! and is such that 

F(—2, 1) = 0 and F,(—2, 1) = 7, F,(—2, 1) = 5. Let 

G(x, y, 2) = F(x? — 2y? + 2°, xy — x7z + 3). 

(a) Check that G(—1, 1, 1) = 0. 

(b) Show that we can solve the equation G(x, y, z) = 
0 for z in terms of x and y (1.e., as z = g(x, y), for 
(x, y) near (—1, 1) so that g(—1, 1) = 1). 


Can you solve 


X2y2 — xı cosy; = 5 
x2 sin yı + X1y2 = 2 


for yı, y2 as functions of xı, x2 near the point 
(x1, x2, Y1, Y2) = (2,3, 2,1)? What about near the 
point (x1, x2, Y1, y2) = (0, 2, 7/2, 5/2)? 
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47. Consider the system 


xyi — 2x3 = 1 
xyi + Xoy2 — 4yay3 = —9. 
X271 + 3x1y3 = 12 


(a) Near which points of R? can we solve for r, 6, and 
z in terms of the Cartesian coordinates? 


(b) Explain the geometry behind your answer in 
part (a). 
49. Recall that the equations relating spherical and Carte- 


(a) Show that, near the point (x1, x2, Y1, Y2, Y3) = sian coordinates in R? are 


(1, 0, —1, 1, 2), it is possible to solve for y1, y2, 


y3 in terms of x1, x2. 


(b) From the result of part (a), we may consider y1, yo, 


x = p sing cos 


y = psingsinð. 


y; to be functions of xı and x2. Use implicit differ- Z = p COS Q 


T f əy1 
entiation and the chain rule to evaluate aa , 0), (a) Near which points of R? can we solve for p, g, and 
1 


dy2 


dy3 
— (1, 0), and — (1, 0). 
Dx | ), an ax: ) 


6 in terms of x, y, and z? 
(b) Describe the geometry behind your answer in 
part (a). 


48. Consider the equations that relate cylindrical and 


Cartesian coordinates in R3: 


x =rcosé 
y =rsinð. 
Z=Z 


Figure 2.75 The tangent line to 
y = f(x) at (xo, f(xo)) crosses the 
X-axis at x = x]. 


2.7 Newton’s Method (optional) 


When you studied single-variable calculus, you may have learned a method, known 
as Newton’s method (or the Newton—Raphson method), for approximating the 
solution to an equation of the form f(x) = 0, where f: X C R —> R is a differ- 
entiable function. Here’s a reminder of how the method works. 

We wish to find a number r such that f(r) = 0. To approximate r, we make 
an initial guess xo for r and, in general, we expect to find that f (xo) 4 0. So next 
we look at the tangent line to the graph of f at (xo, f (xo)). (See Figure 2.75.) 
Since the tangent line approximates the graph of f near (xo, f(xo)), we can 
find where the tangent line crosses the x-axis. The crossing point (xı, 0) will 
generally be closer to (r, 0) than (xo, 0) is, so we take x; as a revised and improved 
approximation to the root r of f(x) = 0. 

To find xı, we begin with the equation of the tangent line 


y = f (xo) + fox — Xo), 

then set y = 0 to find where this line crosses the x-axis. Thus, we solve the equation 
f (x0) + f’ Go) — xo) = 0 

for x, to find that 

ic 

f'(Xo) 
Once we have xı, we can start the process again using xı in place of xo and 
produce what we hope will be an even better approximation x, via the formula 


X1 = X0 


fœ 
X2 = X1 — ET AE 
fœ) 
Indeed, we may iterate this process and define x, recursively by 
pag a. k= l ee (1) 
Sf! (Xk-1) 


and thereby produce a sequence of numbers xo, %1,..., Xk, -+ 
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It is not always the case that the sequence {xg} converges. However, when 
it does, it must converge to a root of the equation f(x) = 0. To see this, let 
L = lim; œ xz. Then we also have limy_,.. x,_1 = L. Taking limits in formula 
(1), we find 


fD) 
FLY 
which immediately implies that f (L) = 0. Hence, L is a root of the equation. 
Now that we have some understanding of derivatives in the multivariable 


case, we turn to the generalization of Newton’s method for solving systems of n 
equations in n unknowns. We may write such a system as 


L=L 


Fils eX) = 0 
PQs 2p %q) =O 

(2) 
FilXinnsig My) =O 


We consider the map f: X C R” — R” defined as f(x) = (f1), ..., fn(X)) (Le., 
f is the map whose component functions come from the equations in (2). The 
domain X of f may be taken to be the set where all the component functions are 
defined.) Then to solve system (2) means to find a vector r = (r1, ..., Fn) such 
that f(r) = 0. To approximate such a vector r, we may, as in the single-variable 
case, make an initial guess xq for what r might be. If f is differentiable, then we 
know that y = f(x) is approximated by the equation 


y = f(x) + Df(xo)(x — Xo). 


(Here we think of f(xo) and the vectors x and xo as n x 1 matrices.) Then we set 
y equal to 0 to find where this approximating function is zero. Thus, we solve the 
matrix equation 


fo) + Df(x0)(x1 — Xo) = 0 (3) 
for x; to give a revised approximation to the root r. Evidently (3) is equivalent to 
D£(X0)(X1 — Xo) = —f(x0). (4) 


To continue our argument, suppose that Df(xq) is an invertible n x n matrix, 
meaning that there is a second n x n matrix [Df(xo)]~! with the property that 
[Df(xo)]~! Df(xo) = Df(x9)[Df(xo)]~! = I, the n x n identity matrix. (See Ex- 
ercises 20 and 30-38 in §1.6.) Then we may multiply equation (4) on the left by 
[Df(xo)]~! to obtain 
In(X1 — Xo) = —[Dfx0)] ' fx0). 

Since [,A = A for any n x k matrix A, this last equation implies that 

xı = Xo — [Df(xo)]'f(x0). (5) 


As we did in the one-variable case of Newton’s method, we may iterate formula 
(5) to define recursively a sequence {x4} of vectors by 


Xk = X1 — [DE] fE) 
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Note the similarity between formulas (1) and (6). Moreover, just as in the case 
of formula (1), although the sequence {xo, X1,..., Xx, ...} may not converge, if 
it does, it must converge to a root of f(x) = 0. (See Exercise 4.) 


EXAMPLE 1 Consider the problem of finding the intersection points of the cir- 
cle x? + y? = 4and the hyperbola 4x? — y? = 4. (See Figure 2.76.) Analytically, 
we seek simultaneous solutions to the two equations 


x+y=4 and 4x?-y'=4, 


or, equivalently, solutions to the system 


Figure 2.76 Finding 
the intersection | 


x? +y —4=0 
4x? — y? —4=0 ` 


points of the circle 
x? + y? = 4 and 
the hyperbola 

4x? — y? = 4 in 
Example 1. 


(7) 


To use Newton’s method, we define a function f: R? —> R? by f(x, y) = (x? + 
y? — 4, 4x? — y? — 4) and try to approximate solutions to the vector equation 
f(x, y) = (0, 0). We may begin with any initial guess, say, 


and then produce successive approximations x, X2,... to a solution using for- 
mula (6). In particular, we have 


2x 2y 
Note that det Df(x, y) = —20xy. You may verify (see Exercise 36 in §1.6) that 
1 l 
_ 1 —2y —2y 10x 10x 
Df "= = 
[ (x; y)] —20xy | —8x 2x | 2 E S 
5y 10y 


| rs | 
SoS 
Yr FS 
Ls 
ll 


Dt ae 
| ‘ i = [Die y1] has Y1) 
Yk—-1 


1 1 
ke | 1Oxk-i 10x14 | x tha 4 | 


II 


2 1 


_ Axe | = Yet =4 
SYk-1 10yz—ı 


5x7_,—8 PEOR 5x? -8 
= | = 10x,_1 = B 10x1 
Yk—ı 5yż; -12 Sy? -12 


10yx—1 


2.7 | Newton’s Method (optional) 179 


Beginning with xo = yo = 1, we have 


51? —8 514512 
=]—-—___ =13 =1-——— “=17 
a 10-1 A 10-1 
4(1.3)° — 8 51.7? — 12 
psi- oes pas ON 
10(1.3) 10(1.7) 


= 1.555882, etc. 


It is also easy to hand off the details of the computation to a calculator or a 
computer. One finds the following results: 


Xk Yk 


1 1 

1.3 1:7 
1.26538462 | 1.55588235 
1.26491115 | 1.54920772 
1.26491106 | 1.54919334 
1.26491106 | 1.54919334 


ABWNrF CO > 


Thus, it appears that, to eight decimal places, an intersection point of the curves 
is (1.26491106, 1.54919334). 
In this particular example, it is not difficult to find the solutions to (7) exactly. 
We add the two equations in (7) to obtain 
5x°-8=0 > x=), 


Thus, x = +,/8/5. If we substitute these values for x into the first equation of 
(7), we obtain 


$+y°-4=0 = ye=?t. 


Hence, y = +./12/5. Therefore, the four intersection points are 


E [12 ie [12 1c [12 fe [12 
5V5’ 5V5’ 5 5J’ a 5J 
Since /8/5 © 1.264911064 and ~12/5 ~ 1.54919334, we see that Newton’s 
method provided us with an accurate approximate solution very quickly. + 
EXAMPLE 2 We use Newton’s method to find solutions to the system 


x? — 5x? +2x—-y+13=0 
| 4 (8) 


x? +x? — l4x—y—19=0 ` 


Asin the previous example, we define f: R? —> R? by f(x, y) = (x? — 5x? + 2x — 
y + 13, x? +x? — 14x — y — 19). Then 


3x? — 10x +2 1 
Die) =| a ya a ak 
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so that det Df(x, y) = 12x — 16 and 


1 1 
Tax — 16 12x — 16 
[Df(x, y)] -3x2 —2x +14 —3x? — 10x +2 
12x — 16 12x — 16 | 


Thus, formula (6) becomes 


1 1 
Xk] _ Xk-1 7 12x,_1 — 16 12x,_1 — 16 | 
y| Lye- i 3x; — 2x1 + 14 | 


12x1 — 16 12x,_1 — 16 


5x2; + 2xK-1 


xe ye-1 +13 


x 
J +x — 1444-1) — Yk=1i = 19 


6x?_; — 16x1 — 32 
12x,_; — 16 
3xf¢_, — 16xp_, — 14x2_, + 82xn-1 — 8yk—-1 + 6xXk—-1yk-1 + 72 
6xp_-1 — 8 


Xk-1 


Vk-1 


This is the formula we iterate to obtain approximate solutions to (8). 
If we begin with xp = (xo, yo) = (8, 10), then the successive approximations 
x; quickly converge to (4, 5), as demonstrated in the table below. 


Xk 


Yk 


DNnRWNrK CO > 


8 
5.2 
4.1862069 
4.00607686 
4.00000691 
4.00000000 
4.00000000 


10 
—98.2 
—2.7412414 
4.82161865 
4.99981073 
5.00000000 
5.00000000 


If we begin instead with xp = (50, 60), then convergence is, as you might predict, 


somewhat slower (although still quite rapid): 


OMAAINDMNBWNK OC > 


Xk 


50 
25.739726 
13.682211 

7.79569757 
5.11470969 
4.1643023 
4.00476785 
4.00000425 
4.00000000 
4.00000000 


Yk 


60 
—57257.438 
—7080.8238 
—846.58548 
—86.660453 
—1.6486813 
4.86119425 
4.99988349 
5.00000000 
5.00000000 
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On the other hand, if we begin with xp = (—2, 12), then the sequence of points 
generated converges to a different solution, namely, (—4/3, —25/27): 


k Xk Yk 
0 —2 12 
1 —1.4 1.4 
2 | —1.3341463 —0.903122 
3 | —1.3333335 | —0.9259225 
4 | —1.3333333 | —0.9259259 
5 | —1.3333333 | —0.9259259 


In fact, when a system of equations has multiple solutions, it is not always 
easy to predict to which solution a given starting vector xg will converge under 
Newton’s method (if, indeed, there is convergence at all). + 


Finally, we make two remarks. First, if at any stage of the iteration process the 
matrix Df(x;) fails to be invertible (i.e., [Df(x;,)]~! does not exist), then formula 
(6) cannot be used. One way to salvage the situation is to make a different choice 
of initial vector xo in the hope that the sequence {x4} that it generates will not 
involve any noninvertible matrices. Second, we note that if, at any stage, x, is 
exactly a root of f(x) = 0, then formula (6) will not change it. (See Exercise 7). 


2.7 Exercises 


Ti 1. Use Newton’s method with initial vector xọ = (1, —1) 
to approximate the real solution to the system 


yer =3 
2ye* + 10y* = 0 


2. In this problem, you will use Newton’s method to 
estimate the locations of the points of intersection 
of the ellipses having equations 3x? + y? =7 and 
x? 4+ 4y? = 8. 

(a) Graph the ellipses and use your graph to give a very 
rough estimate (xo, yo) of the point of intersection 
that lies in the first quadrant. 


(b) Denote the exact point of intersection in the first 
quadrant by (X, Y). Without solving, argue that 
the other points of intersection must be (—X, Y), 
(X, —Y), and (—X, —Y). 

Q (c) Now use Newton’s method with your estimate 
(xo, Yo) in part (a) to approximate the first quadrant 
intersection point (X, Y). 

(d) Solve for the intersection points exactly, and com- 
pare your answer with your approximations. 

3. This problem concerns the determination of the points 
of intersection of the two curves with equations x? — 
4y? = Land x? + 4y? =2. 


eT (a) Graph the curves and use your graph to give rough 
estimates for the points of intersection. 


@ (b) Now use Newton’s method with different initial 
estimates to approximate the intersection points. 


4. Consider the sequence of vectors xo, X1, ..., where, 
for k > 1, the vector x, is defined by the Newton’s 
method recursion formula (6) given an initial “guess” 
Xo at a root of the equation f(x) = 0. (Here we as- 
sume that f: X C R” > R” is a differentiable func- 
tion.) By imitating the argument in the single-variable 
case, show that if the sequence {x,} converges to a vec- 
tor L and Df(L) is an invertible matrix, then L must 
satisfy f(L) = 0. 


5. This problem concerns the Newton’s method iteration 
in Example 1. 


P (a) Use initial vector x9 = (—1, 1) and calculate the 
successive approximations X1, X2, X3, etc. To what 
solution of the system of equations (7) do the ap- 
proximations converge? 


KTI (b) Repeat part (a) with xo = (1, —1). Repeat again 
with x9 = (—1, —1). 
(c) Comment on the results of parts (a) and (b) and 


whether you might have predicted them. Describe 
the results in terms of Figure 2.76. 
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8. Suppose that f: X C R? > R? is differentiable and 
that we write f(x, y) = (f(x, y), g(x, y)). Show that 
formula (6) implies that, for k > 1, 

f (&k-1, Vk) Sy Kk-1 Ve-1) — 8(Xk-1, Yk-1)fy(X%k-1, Yk-1) 
Si (Xk=15 Yk—1)8y(Xk-1, Yk—1) — fy k-11, Yk-1)8x(Xk-1, Yk-1) 


Xk = Xk-1 


B(XK—15 Vk—-1) fo %k—-1 Yk-1) — J (Xk-1, Yk-1)8x(Xk-1, Yk-1) 


Vk = Yk-1 


6. Consider the Newton’s method iteration in Example 2. 

È (a) Use initial vector xo = (1.4, 10) and calculate the 
successive approximations X1, X2, X3, etc. To what 
solution of the system of equations (8) do the ap- 
proximations converge? 


Q (b) Repeat part (a) with xo = (1.3, 10). 


(c) In Example 2 we saw that (4, 5) was a solution of 
the given system of equations. Is (1.3, 10) closer to 
(4, 5) or to the limiting point of the sequence you 
calculated in part (b)? 


(d) Comment on your observations in part (c). What 
do these observations suggest about how easily you 
can use the initial vector xo to predict the value of 
limk oo Xx (assuming that the limit exists)? 


7. Suppose that at some stage in the Newton’s method it- 
eration using formula (6), we obtain a vector x, that is 
an exact solution to the system of equations (2). Show 
that all the subsequent vectors x41, Xk+2, . . . are equal 
to x. Hence, if we happen to obtain an exact root via 
Newton’s method, we will retain it. 


Fexri; Yk-1)8y(Xk-1; Yea) — Fy emis Yk-1)8x(Xk-1; Yk—1) 


ap 9. As we will see in Chapter 4, when looking for maxima 


True/False Exercises for Chapter 2 


1. The component functions of a vector-valued function 
are vectors. 


8. 


and minima of a differentiable function F: X C R” > 
R, we need to find the points where DF (x1, ..., Xn) = 
[0 --- 0J, called critical points of F. Let F(x, y) = 
4sin(xy) + x? + y>. Use Newton’s method to approx- 
imate the critical point that lies near (x, y) = (—1, —1). 


10. Consider the problem of finding the intersection points 

of the sphere x? + y? + z? = 4, the circular cylinder 

x? + y? = 1, and the elliptical cylinder 4y? + z? = 4. 

eT (a) Use Newton’s method to find one of the intersec- 
tion points. By choosing a different initial vector 
Xo = (xo, Yo, Zo), approximate a second intersec- 
tion point. (Note: You may wish to use a computer 
algebra system to determine appropriate inverse 
matrices.) 


(b) 


Find all the intersection points exactly by means of 
algebra and compare with your results in part (a). 


The graph of any function of two variables is a level set of 
a function of three variables. 


2. The domain of f(x, y) = (x? +y? +1, 3 ae is 9. The level set of any function of three variables is the graph 
x+y y of a function of two variables. 
(y) ER? |y #0, x #y} x2 = 2y? 
10. lim yer ee =l, 
3 x (x,y)>(0,0) x+y 
3. The range of f(x, y) = (x a a z) is 3 7 
x+y y y =x h 0.0 
{(u, v, w) € RÈ | u > 1}. 11. E fæ y= xy ™ en (x, y) # (0,0) then f is 
4. The function f: R? — {(0, 0, 0)} > R3, f(x) = 2x/||x|| , 2 when (x, y) = (0, 0) 
; continuous. 
is one-one. 
12. If f(x,y) approaches a number L as (x, y) > (a,b) 


5. The graph of x = 9y? + 27/4 is a paraboloid. 
6. The graph of z + x? = y? is a hyperboloid. 


7. The level set of a function f(x, y, z) is either empty 
or a surface. 


13. 


along all lines through (a, b), then lim, y)~(a,p) f(x, y) 
= Lh, 


If limy_,, f(x) exists and is finite, then f is continuous 
at a. 
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n f(x, b) — fla, b) 24. The tangent plane to z = x?/(y + 1) at the point 
ae: f(a, b) = lim x-a f (—2, 0, —8) has equation z = 12x + 8y + 16. 
15. If f(x, y, z) = sin y, then V f(x, y, z) = cos y. 25. The plane tangent to xy/z? = 1 at (2, 8, —4) has equa- 
16. If f: R? > Rt is differentiable, then Df(x) is a 3 x 4 tion 4x + y +2z = 8. 
matrix. 26. The plane tangent to the surface x? + xye + y? = 
17. Iff is differentiable at a, then f is continuous at a. . E vi a Be pO) As parallel to nthe vector 
18. Iff is continuous at a, then f is differentiable at a. 
27. D = of 
19. Ifall partial derivatives f/x1,..., Əf/ðxn ofa func- »DIGYD= ay 
tion f(x1,...,X,) exist ata = (a1, .. . , an), then f is af 
differentiable at a. 28. Dif(x,y, d= a 
z 


20. Iff: R — R and g: R* — R> are both differentiable 


= gi i i in R2 
ata € R4, then D(f — g)(a) = Df(a) — Dg(a). 29. If f(x, y) = sinx cos y and v is a unit vector in R4, 


m T V2 
21. Theres a function f of class C? such that then 0 < Dy f (5. =) aa 
ð 
`f = y? — 2x and < = y—3xy. 30. If v is a unit vector in R? and f(x, y, z) = sinx — 
cos y + sin z, then 
22. If the second-order partial derivatives of f exist at 
(a, b), then Tola; b) = Ayx(a, b). —V3 < Dy f(x, y,z) < J/3. 


23. If w = F(x, y, z) and z = g(x, y) where F and g are 
differentiable, then 


ðw aF OF ag 
ax Ox Oz ax 
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1. Let f(x) = (i+ k) x x. order. Complete the following table by matching each 
function in the table with it h lot of its level 
(a) Write the component functions of f. pena inne tabe with its grapi and plot cla leye 
(b) Describe the domain and range of f. 
Graph Level curves 
2. Let f(x) = projzi-2j+k¥, Where x = xi + yj + zk. Function (uppercase | (lowercase 
(a) Describe the domain and range of f. F(x, y) letter) letter) 
(b) Write the component functions of f. 
fy) = x? +y? +1 
f(x, y) = sin yx? + y? 
3. Let SQ, y) = /Xy. f(x, y) = By? ne 2x2)e -2 
(a) Find the domain and range of f. fœ y) =y — 3x°y , 
(b) Is the domain of f open or closed? Why? f(x, y) = xy *e oo : 
fœ, y) = ye" 
X eee aal —ŮŮŮ 
4. Let g(x, y) = . 6. Consider the function f(x, y) = 2 + In(x? + y3. 
(a) Determine the domain and range of g. (a) pene a ee a fae aan 
(b) Is the domain of g open or closed? Why? give a few moe 
5. Figure 2.77 shows the graphs of six functions f(x, y) (b) Using part (a) or otherwise, give a rough sketch of 


and plots ofthe collections oftheir level curves in some the graph of z = f(x, y). 
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Figure 2.77 Figures for Exercise 5. 


7. Use polar coordinates to evaluate 9. Let 
2 3 2 
im g a Pee aoe ) ite y #0. 0) 
8. This problem concerns the function 0 if (x, y) = (0, 0) 
2xy ; Show that the function g(x) = F(x, 0) is continuous at 
fa.yy= ery? nee AO) x = 0. Show that the function h(y) = F(0, y) is con- 


tinuous at y = 0. However, show that F fails to be 
continuous at (0, 0). (Thus, continuity in each variable 
separately does not necessarily imply continuity of the 
function.) 


0 if (x, y) = (0, 0) 


(a) Use polar coordinates to describe this function. 


(b) Using the polar coordinate description obtained in 
part (a), give some level curves for this function. 10 


Suppose f:U C R” — R is not defined at a point 
a € R” but is defined for all x near a. In other words, 
the domain U of f includes, for some r > 0, the set 
B, = {x € R” | 0 < ||x —all < r}. (The set B, is just 
(e) Is f continuous? Why or why not? an open ball of radius r centered at a with the point 


(c) Prepare a rough sketch of the graph of f. 
(d) Determine lim(,,).(0,0) f(x, y), if it exists. 


a deleted.) Then we say limy-,, f(x) = +œ if f(x) 

grows without bound as x — a. More precisely, this 

means that given any N > 0 (no matter how large), 

there is some ô > 0 such that if 0 < ||x — al] < ô (e., 

ifx € B,), then f(a) > N. 

(a) Using intuitive arguments or the preceding tech- 
nical definition, explain why lim,_,9 1 /x? =. 


(b) Explain why 


2 
lim = 
y0. (x — 1)? + (y — 3)? 


(c) Formulate a definition of what it means to say that 
lim f(x) = —oo. 
x>a 


(d) Explain why 


l-—x 


lim = 
(x,y) (0,0) xy — y4 +x — x2 


Exercises ] 1—17 involve the notion of windchill temperature— 
see Example 7 in §2.1, and refer to the table of windchill values 
on page 85. 


11. 


12. 


13. 


14. 


15. 


a) Find the windchill temperature when the air tem- 
p 
perature is 25°F and the windspeed is 10 mph. 


(b) If the windspeed is 20 mph, what air temperature 
causes a windchill temperature of — 15°F? 


(a) Ifthe air temperature is 10°F, estimate (to the near- 
est unit) what windspeed would give a windchill 
temperature of —5°F. 

(b) Do you think your estimate in part (a) is high or 
low? Why? 


Ata windspeed of 30 mph and air temperature of 35°F, 
estimate the rate of change of the windchill tempera- 
ture with respect to air temperature if the windspeed is 
held constant. 


Ata windspeed of 15 mph and air temperature of 25°F, 
estimate the rate of change of the windchill tempera- 
ture with respect to windspeed. 


Windchill tables are constructed from empirically de- 
rived formulas for heat loss from an exposed sur- 
face. Early experimental work of P. A. Siple and C. F. 
Passel,* resulted in the following formula: 


W = 91.4 + (t — 91.4)(0.474 + 0.304/5 — 0.02035). 


16. 


17. 
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Here W denotes windchill temperature (in degrees 

Fahrenheit), ¢ the air temperature (fort < 91.4°F), and 

s the windspeed in miles per hour (for s > 4 mph). 

(a) Compare your answers in Exercises 11 and 12 with 
those computed directly from the Siple formula 
just mentioned. 


(b) Discuss any differences you observe between your 
answers to Exercises 11 and 12 and your answers 
to part (a). 

(c) Why is it necessary to take ¢ < 91.4°F and 
s > 4 mph in the Siple formula? (Don’t look for 
a purely mathematical reason; think about the 
model.) 


Recent research led the United States National Weather 
Service to employ a new formula for calculating wind- 
chill values beginning November 1, 2001. In partic- 
ular, the table on page 85 was constructed from the 
formula 


W = 35.74 + 0.621t — 35.755°!® + 0.4275r5° 6. 


Here, as in the Siple formula of Exercise 15, W de- 
notes windchill temperature (in degrees Fahrenheit), 
t the air temperature (for t < 50°F), and s the wind- 
speed in miles per hour (for s > 3 mph).° Compare 
your answers in Exercises 13 and 14 with those com- 
puted directly from the National Weather Service 
formula above. 


In this problem you will compare graphically the two 

windchill formulas given in Exercises 15 and 16. 

(a) If Wi(s, t) denotes the windchill function given by 
the Siple formula in Exercise 15 and W2(s, t) the 
windchill function given by the National Weather 
Service formula in Exercise 16, graph the curves 
y = W,(s, 40) and y = W2(s, 40) on the same set 
of axes. (Let s vary between 3 and 120 mph.) In 
addition, graph other pairs of curves y = Wj(s, fo), 
y = W,(s, to) for other values of tọ. Discuss what 
your results tell you about the two windchill 
formulas. 

(b) Now graph pairs of curves y = W\(so,t), y = 
W2(so, t) for various constant values so for wind- 
speed. Discuss your results. 


(c) Finally, graph the surfaces z= W\(s,t) and 
z = W2(s, t) and comment. 


4 “Measurements of dry atmospheric cooling in subfreezing temperatures,” Proc. Amer. Phil. Soc., 89 


(1945), 177-199. 


> From Bob Rilling, Atmospheric Technology Division, National Center for Atmospheric Research 
(NCAR), “Calculating Windchill Values,” February 12, 1996. Found online at http://www.atd.ucar.edu/ 
homes/rilling/we_formula.html (July 31, 2010). 

6 NOAA, National Weather Service, Office of Climate, Water, and Weather Services, “NWS Wind Chill 
Temperature Index.” February 26, 2004. <http://www.nws.noaa.gov/om/ windchill> (July 31, 2010). 
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18 


19. 


20. 


21. 


22. 


23. 


24. 


25. 
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. Consider the sphere of radius 3 centered at the origin. 
The plane tangent to the sphere at (1, 2, 2) intersects 
the x-axis at a point P. Find the coordinates of P. 


Show that the plane tangent to a sphere at a point P on 
the sphere is always perpendicular to the vector OP 
from the center O of the sphere to P. (Hint: Locate the 
sphere so its center is at the origin in R°.) 


The surface z= 3x? + 4x3 — 3x4 — 4y? is inter- 
sected by the plane 2x — y = 1. The resulting intersec- 
tion is a curve on the surface. Find a set of parametric 
ciano: for the line tangent to this curve at the point 
(1, 1, — 5). 


Consider the cone z? = x? + y’. 

(a) Find an equation of the plane tangent to the cone 
at the point (3, —4, 5). 

(b) Find an equation of the plane tangent to the cone 
at the point (a, b, c). 

(c) Show that every tangent plane to the cone must 
pass through the origin. 


Show that the two surfaces 


So Zz = yenye 


and i 


Sj: Z=xy 
intersect perpendicularly at the point (2, 1, 2). 


Consider the surface z = x? + 4y?. 


(a) Find an equation for the plane that is tangent to the 
surface at the point (1, —1, 5). 


(b) Now suppose that the surface is intersected with the 
plane x = 1. The resulting intersection is a curve 
on the surface (and is a curve in the plane x = 1 
as well). Give a set of parametric equations for the 
line in R? that is tangent to this curve at the point 
(1, —1, 5). A rough sketch may help your thinking. 


A turtleneck sweater has been washed and is now tum- 
bling in the dryer, along with the rest of the laundry. At 
a particular moment fo, the neck of the sweater mea- 
sures 18 inches in circumference and 3 inches in length. 
However, the sweater is 100% cotton, so that at fo the 
heat of the dryer is causing the neck circumference to 
shrink at a rate of 0.2 in/min, while the twisting and 
tumbling action is causing the length of the neck to 
stretch at the rate of 0.1 in/min. How is the volume V 
of the space inside the neck changing at t = tọ? Is V 
increasing or decreasing at that moment? 


A factory generates air pollution each day according 
to the formula 


P(S, T) = 3308°3 745, 


where S denotes the number of machine stations in 
operation and T denotes the average daily tempera- 
ture. At the moment, 75 stations are in regular use and 
the average daily temperature is 15°C. If the average 


26. 


27. 


28. 


29. 


30. 


temperature is rising at the rate of 0.2°C/day and the 
number of stations being used is falling at a rate of 
2 per month, at what rate is the amount of pollution 
changing? (Note: Assume that there are 24 workdays 
per month.) 


Economists attempt to quantify how useful or satisfy- 
ing people find goods or services by means of utility 
functions. Suppose that the utility a particular individ- 
ual derives from consuming x ounces of soda per week 
and watching y minutes of television per week is 
u(x, y)=1— 7 0.001x7—0.00005y?_ 

Further suppose that she currently drinks 80 oz of soda 
per week and watches 240 min of TV each week. If she 
were to increase her soda consumption by 5 oz/week 
and cut back on her TV viewing by 15 min/week, is 
the utility she derives from these changes increasing 
or decreasing? At what rate? 


Suppose that w = x? + y? + z? and x = pcos@ sing, 
y = psing sing, z = p cos Y. (Note that the equations 
for x, y, and z in terms of p, p, and @ are just the con- 
version relations from spherical to rectangular coordi- 
nates.) 

(a) Use the chain rule to compute dw/dp, dw/d9, 
and dw/00. Simplify your answers as much as 
possible. 

(b) Substitute p, g, and 0 for x, y, and z in the original 
expression for w. Can you explain your answer in 
part (a)? 


Ifw=f (* +2), show that 
xy 
20w 23w a5 
ox dy f 


(You should assume that f is a differentiable function 
of one variable.) 


Let z = g(x, y) be a function of class C*, and let 
x = e" cosé, y = e" sind. 
(a) Use the chain rule to find 9z/ðr and 0z/06 in terms 


of dz/dx and 0z/dy. Use your results to solve for 
dz/dx and dz/dy in terms of dz/dr and 0z/00. 


(b) Use part (a) and the product rule to show that 


a7z $ 3z [8z é a7z 
=e z 
ax? əy? 3r? ag? 


(a) Use the function f(x, y) = x” (= e*) and the 


multivariable chain rule to calculate T (u), 
u 
(b) Use the multivariable chain rule to calculate 


d A cost 
g” ). 


31. 


32. 


33. 


34. 


35. 


Use the function f(x, y, z) = x” and the multivariable 


chain rule to calculate an (u*). 
u 


Suppose that f: R” — R is a function of class C?. The 
Laplacian of f, denoted V? f, is defined to be 


a f a f af 
Wf=—t => 
f ax? ax? ax2 


When n = 2 or 3, this construction is important when 
studying certain differential equations that model phys- 
ical phenomena, such as the heat or wave equations. 
(See Exercises 28 and 29 of $2.4.) Now suppose that 
f depends only on the distance x = (x1,...,X,) iS 
from the origin in R”; that is, suppose that f(x) = g(r) 
for some function g, where r = ||x||. Show that for all 
x Æ 0, the Laplacian is given by 


2 n-l 1 n 
V I>", oC rrs (r). 


(a) Consider a function f(x, y) of class C*. Show 
that if we apply the Laplacian operator V? = 
Ə? /dx? + 8*/dy? twice to f, we obtain 


at f at f at f 
VV? f) = : 
wp) ax ax dy? i ay* 


(b) Now suppose that f is a function of n variables of 
class C*. Show that 


weve SL 
ema ee 
i= Ley 
Functions that satisfy the partial differential equa- 
tion V?(V? f) = 0 are called biharmonic func- 
tions and arise in the theoretical study of elasticity. 


Livinia, the housefly, finds herself caught in the oven 
at the point (0, 0, 1). The temperature at points in the 
oven is given by the function 


T(x, yz) = 10(xe~”” + ze"), 

where the units are in degrees Celsius. 

(a) If Livinia begins to move toward the point (2, 3, 1), 
at what rate (in deg/cm) does she find the temper- 
ature changing? 

(b) In what direction should she move in order to cool 
off as rapidly as possible? 

(c) Suppose that Livinia can fly at a speed of 3 cm/sec. 
If she moves in the direction of part (b), at what 
(instantaneous) rate (in deg/sec) will she find the 
temperature to be changing? 


Consider the surface given in cylindrical coordinates 

by the equation z = r cos 30. 

(a) Describe this surface in Cartesian coordinates, that 
is, as z= f(x, y). 


36. 
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(b) Is f continuous at the origin? (Hint: Think cylin- 
drical.) 

(c) Find expressions for df/dx and df/dy at points 
other than (0, 0). Give values for 0f/0x and 3f /ðy 
at (0, 0) by looking at the partial functions of f 
through (x, 0) and (0, y) and taking one-variable 
limits. 

(d) Show that the directional derivative Da f (0, 0) ex- 
ists for every direction (unit vector) u. (Hint: Think 
in cylindrical coordinates again and note that you 
can specify a direction through the origin in the 
xy-plane by choosing a particular constant value 
for 0.) 


(e) Show directly (by examining the expression for 
df/dy when (x, y) (0, 0) and also using part (c)) 
that 0f/dy is not continuous at (0, 0). 


(f) Sketch the graph of the surface, perhaps using a 
computer to do so. 


The partial differential equation 
u u u au 
ca 
əx? ay? az? at? 


is known as the wave equation. It models the motion 
ofa wave u(x, y, z, t)in R? and was originally derived 
by Johann Bernoulli in 1727. In this equation, c is a 
positive constant, the variables x, y, and z represent 
spatial coordinates, and the variable t represents time. 
(a) Let u = cos(x — t) + sin(x + t) — 2e*** — (y — 
t}. Show that u satisfies the wave equation with 
e= 1, 
(b) More generally, show that if fi, f2, 81, g2, 41, and 
hz are any twice differentiable functions of a single 
variable, then 


u(x, y, z,¢) = fix —t)t+ fox +t) 
+ gi(y —t)+ g2(y +1) 
+hy(z—t)+ho(z+t) 


satisfies the wave equation with c = 1. 


Let X be an open set in R”. A function F: X — R is said to be 


homogeneous of degree d if, for all x = (x1, x2, . . . , Xn) E€ X 
and all t € R such that tx € X, we have 
F(tx,,txo,...,tX%,) = t! F(x, KF, shay Bai 


Exercises 37-44 concern homogeneous functions. 


In Exercises 37—41, which of the given functions are homoge- 
neous? For those that are, indicate the degree d of homogeneity. 


37. 
38. 
39. 


F(x,y)= x3 + xy? — 6y? 
F(x, y, 2) = xy — x?z? + z’ 
F(x, y, z) = zy? — x? 4x72 
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40. 


41. 


42. 


43. 
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F(x, y) = 2” 
Hy y2 


PUR, y, z) = xyz + 7xz2 


If F(x, y,z) is a polynomial, characterize what it 
means to say that F is homogeneous of degree d (i.e., 
explain what must be true about the polynomial if it is 
to be homogeneous of degree d). 


Suppose F(x), X2, ..., Xn) is differentiable and homo- 
geneous of degree d. Prove Euler’s formula: 


44. 


(Hint: Take the equation F(tx,, tx2,...,tX,) = 
tF (x1, X2,..-, Xn) that defines homogeneity and dif- 
ferentiate with respect to t.) 


Generalize Euler’s formula as follows: If F is of class 
C? and homogeneous of degree d, then 


n 2 


32F 
P = d(d — 1)F. 
DEIF i ek 


i,j=l shee 


Can you conjecture what an analogous formula 
involving the kth-order partial derivatives should look 
like? 
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x 


Figure 3.1 The path x of 
Example 1. 


Vector- Valued 
Functions 


Introduction 


The primary focus of Chapter 2 was on scalar-valued functions, although general 
mappings from R” to R” were considered occasionally. This chapter concerns 
vector-valued functions of two special types: 


1. Continuous mappings of one variable (i.e., functions x: Z C R —> R”, where 
I is an interval, called paths in R”). 
2. Mappings from (subsets of) R” to itself (called vector fields). 


An understanding of both concepts is required later, when we discuss line and 
surface integrals. 


3.1 Parametrized Curves and Kepler’s Laws 


Paths in R” 
We begin with a simple definition. Let Z denote any interval in R. (So J can be 
of the form [a, b], (a, b), [a, b), (a, b], [a, 00), (a, œœ), (—20, b], (—00, b), or 
(—co, œ) = R.) 


DEFINITION 1.1 A path in R” is a continuous function x: J > R”. If I = 
[a, b] for some numbers a < b, then the points x(a) and x(b) are called the 
endpoints of the path x. (Similar definitions apply if J = [a, b), [a, oo), etc.) 


EXAMPLE 1 Let a and b be vectors in R? with a Æ 0. Then the function 
x: (—00, 00) > R? given by 


x(t)=b-+ta 
defines the path along the straight line parallel to a and passing through the end- 
point of the position vector of b as in Figure 3.1. (See formula (1) of 81.2.) + 
EXAMPLE 2 The pathy: [0, 277) > R? given by 
y(t) = (3 cost, 3 sint) 


can be thought of as the path of a particle that travels once, counterclockwise, 
around a circle of radius 3 (Figure 3.2). + 
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Figure 3.4 The path x and its 
velocity vector v. 


Figure 3.2 The path y of Figure 3.3 The path z of 
Example 2. Example 3. 


EXAMPLE 3 The map z: R > R? defined by 
z(t) = (acost,asint, bt), a,b constants (a > 0) 


is called a circular helix, so named because its projection in the xy-plane is a 
circle of radius a. The helix itself lies in the right circular cylinder x? + y? = a? 
(Figure 3.3). The value of b determines how tightly the helix twists. + 


We distinguish between a path x and its range or image set x(J/), the latter 
being a curve in R”. By definition, a path is a function, a dynamic object (at least 
when we imagine the independent variable ¢ to represent time), whereas a curve 
is a Static figure in space. With such a point of view, it is natural for us to consider 
the derivative Dx(t), which we also write as x’(t) or v(t), to be the velocity vector 
of the path. We can readily justify such terminology. Since 


x(t) = (x1 (f), x2(t), ..., Xn(t)) 
is a function of just one variable, 
x(t + At) — x(t) 


WO =O) jn N 


Thus, v(t) is the instantaneous rate of change of position x(t) with respect to t 
(time), so it can appropriately be called velocity. Figure 3.4 provides an indication 
as to why we draw v(t) as a vector tangent to the path at x(t). Continuing in this 
vein, we introduce the following terminology: 


DEFINITION 1.2 Letx: J —> R” bea differentiable path. Then the velocity 
v(t) = x'(t) exists, and we define the speed of x to be the magnitude of 
velocity; that is, 


Speed = ||v(‘)|). 


If v is itself differentiable, then we call v(t) = x” (t) the acceleration of x 
and denote it by a(t). 


EXAMPLE 4 The helix x(t) = (a cost, a sint, bt) has 


v(t) = —asinti+acostj+bk and a(t)=—acosti—asintj. 


(A 


x(0) 


x 


Figure 3.5 The path of the line 
tangent to x(t) at the point xo. 
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Thus, the acceleration vector is parallel to the x y-plane (i.e., is horizontal). The 
speed of this helical path is 


IIv(t)|| = asint)? + (a cost) +b? = Va? + b, 


which is constant. + 


The velocity vector v is important for another reason, namely, for finding equations 
of tangent lines to paths. The tangent line to a differentiable path x, at the point 
Xo = X(f), is the line through xo that is parallel to any (nonzero) tangent vector 
to x at xg. Since v(t), when nonzero, is always tangent to x(t), we may use equa- 
tion (1) of §1.2 to obtain the following vector parametric equation for the tangent 
line: 


I(s) = Xo + SVo. (1) 


Here vo = v(t) and s may be any real number. 

In equation (1), we have 1(0) = xo. To relate the new parameter s to the 
original parameter t for the path, we set s = t — to and establish the following 
result: 


PROPOSITION 1.3 Let x be a differentiable path and assume that vo = 
v(to) Æ 0. Then a vector parametric equation for the line tangent to x at xq = x(fo) 
is either 

I(s) = Xo + svo (2) 


or 


I(t) = Xo + (t — to)Vo. (3) 
(See Figure 3.5.) 


EXAMPLE 5 If x(t) = (3t + 2, t? — 7, t — t°), we find parametric equations 
for the line tangent to x at (5, —6, 0) = x(1). 
For this path, v(t) = x’(t) = 3i + 2tj + (1 — 2r)k, so that 


vo = V(1) = 3i + 2j — k. 
Thus, by formula (3), 
\(t) = (5i — 6j) + ¢ — 1)Gi + 2j — k). 


Taking components, we read off the parametric equations for the coordinates 
of the tangent line as 


x=3t+2 
y=2t-—8. 
z=l1-t + 


The physical significance of the tangent line is this: Suppose a particle of 
mass mm travels along a path x. If, suddenly, at t = tọ, all forces cease to act on the 
particle (so that, by Newton’s second law of motion F = ma, we have a(t) = 0 
for t > to), then the particle will follow the tangent line path of equation (3). 
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EXAMPLE 6 If Roger Ramjet is fired from a cannon, then we can use vectors 
to describe his trajectory. (See Figure 3.6.) 


y 


Roger’s path 


Figure 3.6 Roger Ramjet’s path. 


We’ll assume that Roger is given an initial velocity vector vo by virtue of the 
firing of the cannon and that thereafter the only force acting on Roger is due to 
gravity (so, in particular, we neglect any air resistance). Let us choose coordinates 
so that Roger is initially at the origin, and throughout our calculations we’ll neglect 
the height of the cannon. Let x(t) = (x(t), y(t)) denote Roger’s path. Then the 
information we have is 


a(t) =x'"(t) = —gj 
(i.e., the acceleration due to gravity is constant and points downward); hence, 


v(0) = x'(0) = vo 
and 


x(0) = 0. 


Since a(t) = w(t), we simply integrate the expression for acceleration compo- 
nentwise to find the velocity: 


vin= f adr = f -eidi = -gri +e. 


Here ¢c is an arbitrary constant vector (the “constant of integration”). Since v(0) = 
vo, we must have ¢ = Vo, so that 

v(t) = —gtjt vo. 
Integrating again to find the path, 


1 
O= f vodi= f git vdi = Jer itive +d, 


where d is another arbitrary constant vector. From the remaining fact that x(0) = 0, 
we conclude that 


1 
x(t) = =] + tvo (4) 


describes Roger’s path. 
To understand equation (4) better, we write Vo in terms of its components: 
Vo = vo cos 0 i + vo sind j. 


Here vo = ||Vo|| is the initial speed. (We’re really doing nothing more than 
expressing the rectangular components of vo in terms of polar coordinates. 


Figure 3.7 Roger’s initial 
velocity. 


Suz 


Figure 3.8 An epicycle. 


Epicycle 
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See Figure 3.7.) Thus, 
x(t) = —5.gt?j + t(vo cos i+ vo sind j) 


1 
= (vo cos 0)t i + (o sin 0)t — ze) j. 


From this, we may read off the parametric equations: 


x = (vo cos 0 )t 
Fi 1 i 
y = (vo sind )t — 58 
from which it is not difficult to check that Roger’s path traces a parabola. + 


Here are two practical questions concerning the set-up of Example 6: First, for 
a given initial velocity, how far does Roger travel horizontally? Second, for a given 
initial speed, how should the cannon be aimed so that Roger travels (horizontally) 
as far as possible? To find the range of the cannon shot and thereby answer the 
first question, we need to know when y = 0 (i.e., when Roger hits the ground). 
Thus, we solve 


(vp sin @)t — igt” = t(vosinð — tgt) = 0 


for t. Hence, y = 0 when t = 0 (which is when Roger blasts off) and when 
t = (2vo sin 0)/g. At this later time, 


2vo sin 0 v2 sin20 
x = (00050)  ( 9 J- 2 : (5) 
& & 
Formula (5) is Roger’s horizontal range for a given initial velocity. To maximize 
the range for a given initial speed vg, we must choose @ so that (v4 sin 20)/g is 
as large as possible. Clearly, this happens when sin 20 = 1 (i.e., when 0 = 77/4). 


Kepler’s Laws of Planetary Motion (optional) 


Since classical antiquity, individuals have sought to understand the motions of 
the planets and stars. The majority of the ancient astronomers, using a combina- 
tion of crude observation and faith, believed all heavenly bodies revolved around 
the earth. Fortunately, the heliocentric (or “sun-centered”) theory of Nicholas 
Copernicus (1473-1543) did eventually gain favor as observational techniques 
improved. However, it was still believed that the planets traveled in circular or- 
bits around the sun. This circular orbit theory did not correctly predict planetary 
positions, so astronomers postulated the existence of epicycles, smaller circular 
orbits traveling along the major circular arc, an example of which is shown in 
Figure 3.8. Although positional calculations with epicycles yielded results closer 
to the observed data, they still were not correct. Attempts at further improvements 
were made using second- and third-order epicycles, but any gains in predictive 
power were made at a cost of considerable calculational complexity. A new idea 
was needed. Such inspiration came from Johannes Kepler (1571—1630), son of a 
saloonkeeper and assistant to the Danish astronomer Tycho Brahe. The classical 
astronomers were “stuck on circles” for they believed the circle to be a perfect 
form and that God would use only such perfect figures for planetary motion. 
Kepler, however, considered the other conic sections to be as elegant as the cir- 
cle and so hypothesized the simple theory that planetary orbits are elliptical. 
Empirical evidence bore out this theory. 
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Figure 3.9 Kepler’s second law 
of planetary motion: If 

h — ti = t4 — h, then Ay = Ad, 
where A, and A> are the areas of 
the shaded regions. 


Kepler’s three laws of planetary motion are 


1. The orbit of a planet is elliptical, with the sun at a focus of the ellipse. 

2. During equal periods of time, a planet sweeps through equal areas with respect 
to the sun. (See Figure 3.9.) 

3. The square of the period of one elliptical orbit is proportional to the cube of 
the length of the semimajor axis of the ellipse. 


Kepler’s laws changed the face of astronomy. We emphasize, however, that 
they were discovered empirically, not analytically derived from general physical 
laws. The first analytic derivation is frequently credited to Newton, who claimed 
to have established Kepler’s laws (at least the first and third laws) in Book I of 
his Philosophiae Naturalis Principia Mathematica (1687). However, a number of 
scientists and historians of science now consider Newton’s proof of Kepler’s first 
law to be flawed and that Johann Bernoulli (1667—1748) offered the first rigorous 
derivation in 1710.! In the discussion that follows, Newton’s law of universal 
gravitation is used to prove all three of Kepler’s laws. 

In our work below, we assume that the only physical effects are those be- 
tween the sun and a single planet—the so-called two-body problem. (The n-body 
problem, where n > 3 is, by contrast, an important area of current mathematical 
research.) To set the stage for our calculations, we take the sun to be fixed at the 
origin O in R? and the planet to be at the moving position P. We also need the 
following two “vector product rules,” whose proofs we leave to you: 


PROPOSITION 1.4 
1. If x and y are differentiable paths in R”, then 


d dx dy 
Pa a ee 
2. If x and y are differentiable paths in R3, then 
d dx dy 
a= Pr xy+xx a 


First, we establish the following preliminary result: 


PROPOSITION 1.5 The motion of the planet is planar, and the sun lies in the 
planet’s plane of motion. 


PROOF Letr = OP. Then r is a vector whose representative arrow has its tail 
fixed at O. (Note that r = r(t); that is, r is a function of time.) If v = r’(t), we 
will show that r x v is a constant vector c. This result, in turn, implies that r must 
always be perpendicular to ¢ and, hence, that r always lies in a plane with ¢ as 
normal vector. 

To show that r x v is constant, we show that its derivative is zero. By part 2 
of Proposition 1.4, 


( ) dr es dv i 
—(r A V] S — x V r xk — =VXx V rxa, 
dt dt dt 


l For an indication of the more recent controversy surrounding Newton’s mathematical accomplishments, 
see R. Weinstock, “Isaac Newton: Credit where credit won’t do,” The College Mathematics Journal, 25 
(1994), no. 3, 179-192, and C. Wilson, ““Newton’s orbit problem: A historian’s response,” Ibid., 193—200, 
and related papers. 
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by the definitions of velocity and acceleration. We know that v x v = 0 (why?), so 
d 
—(r = a. 6 
a (rx v)=rx (6) 


Now we use Newton’s laws. Newton’s law of gravitation tells us that the planet 
is attracted to the sun with a force 

GMm 
2 


F=-—- u, (7) 


7 
where G is Newton’s gravitational constant (= 6.6720 x 107!! Nm?/kg’), M 
is the mass of the sun, m is the mass of the planet (in kilograms), r = ||r||, and 
u = r/||r|| (distances in meters). On the other hand, Newton’s second law of 
motion states that, for the planet, 


F=ma 
Thus, 
GMm 
ma = -= 7 u, 
F 
or 
GM 
r 


Therefore, a is just a scalar multiple of r and hence is always parallel to r. In 
view of equations (6) and (8), we conclude that 


d 
qo arxam 


(i.e., that r x v is constant). E 


THEOREM 1.6 (KEPLER’S FIRST LAW) In a two-body system consisting of one 
sun and one planet, the planet’s orbit is an ellipse and the sun lies at one focus of 
that ellipse. 


PROOF We will eventually find a polar equation for the planet’s orbit and see 
that this equation defines an ellipse as described. We retain the notation from 
the proof of Proposition 1.5 and take coordinates for R? so that the sun is at the 
origin, and the path of the planet lies in the xy-plane. Then the constant vector 
c =r x v used in the proof of Proposition 1.5 may be written as ck, where c is 
some nonzero real number. This set-up is shown in Figure 3.10. 


z 


x 


u (unit length) 


Figure 3.10 Establishing Kepler’s laws. 


196 


Chapter 3 | Vector-Valued Functions 


Step 1. We find another expression for e. By definition of u in formula (7), 
r = ru, so that, by the product rule, 


d du dr 
van ae ae 
Hence, 
c=rxv=(ru)x (+ Zu) =r? (F) eFax. 
dt dt dt dt 


Since u x u must be zero, we conclude that 


e= (ux T): (9) 


Step 2. We derive the polar equation for the orbit. Before doing so, however, 
note the following result, whose proof is left to you as an exercise: 


PROPOSITION 1.7 If x(t) has constant length (i.e., ||x(¢)|| is constant for 
all t), then x is perpendicular to its derivative dx/dt. 


Continuing now with the main argument, note that the vector r(t) is defined 
so that its magnitude is precisely the polar coordinate r of the planet’s position. 
Using equations (8) and (9), we find that 


ance GM 2 du 
xe= | -— xr tux — 
r2 i dt 


ll II 
g l 
a | = 
Po r 
= = 
x x 
a 
Sle <= 
Wee” x 
x a 
= è 
| a | Sees” 
| a | 


d d 
=GM jo . D — (« . =) u (see Exercise 27 of §1.4) 
du we 
=GM kir — Ou (by Proposition 1.7) 
d 
= — (GMu), 
uaa 


since G and M are constant. On the other hand, we can “reverse” the product rule 
to find that 


aren we 
dt 
dv a de ie i wan 
= — — in is constan 
o YS since ¢ is consta: 
d 
= —(vVxc). 


dt 


x 


Figure 3.11 The angle 6 is 
the angle between r and d. 
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Thus, 
axc d GM ) í c) 
x c= — = —(vx 
dt s u” 


and, hence, 
vxc= GMu+d, (10) 


where d is an arbitrary constant vector. Because both v x ¢ and u lie in the xy- 
plane, so must d. 

Let us adjust coordinates, if necessary, so that d points in the i-direction (i.e., 
so that d = di for some d € R). This can be accomplished by rotating the whole 
set-up about the z-axis, which does not lift anything lying in the xy-plane out of 
that plane. Then the angle between r (and hence u) and d is the polar angle 0 as 
shown in Figure 3.11. 

By Theorem 3.3 of Chapter 1, 


u-d = |jul| ||d|| cos = d cos 8. (11) 
Since c = |lell, 
c=e-e 
=(rxv)-e 
=r-(vxc) (Why? See formula (4) of §1.4.) 


=ru-(GMu-+ d) by equation (10). 
Hence, 
c = GMr +rdcosé 
by equation (11). We can readily solve this equation for r to obtain 


c2 


r= — , 
GM +dcosé 
the polar equation for the planet’s orbit. 


(12) 


Step 3. We now check that equation (12) really does define an ellipse by 
converting to Cartesian coordinates. First, we’ll rewrite the equation as 
g (c?/GM) 


"GM +dcosd 1+(d/GM)cos0’ 


and then let p = c?/GM,e = d/GM for convenience. (Note that p > 0.) Hence, 
equation (12) becomes 


j=_—_ (13) 
1+ecosé 
A little algebra provides the equivalent equation, 
r = p-—ercosé. (14) 


Now r cos 0 = x (x being the usual Cartesian coordinate), so that equation (14) 
is equivalent to 


r=p-—ex. 


To complete the conversion, we square both sides and find, by virtue of the fact 
that r? = x? + y’, 


x + y? = p — 2pex + ex. 


198 


Chapter 3 | Vector-Valued Functions 


A little more algebra reveals that 


(1 — e?)x? + 2pex +y = p. (15) 


Therefore, the curve described by the preceding equation is an ellipse if 0 < 
le| < 1, a parabola ife = +1, and a hyperbola if |e| > 1. Analytically, there is no 
way to eliminate the last two possibilities. Indeed, “uncaptured” objects such as 
comets or expendable deep space probes can have hyperbolic or parabolic orbits. 
However, to have a closed orbit (so that the planet repeats its transit across the 
sky), we are forced to conclude that the orbit must be elliptical. 

More can be said about the elliptical orbit. Dividing equation (15) by 1 — e? 
and completing the square in x, we have 


2 
PON ay, We ae 
l=] 1-82 (=e 


This is equivalent to the rather awkward-looking equation 


(tpe -A 
Pee “p= 
From equation (16), we see that the ellipse is centered at the point (— pe/(1 — e°), 


0), that its semimajor axis has length a = p/(1 — e°), and that its semiminor axis 
has length b = p/./1 — e?. The foci of the ellipse are at a distance 


=1. (16) 


2 


2 
Lz 4a Pp Pp plel 
eRe joi E 


l-e 1-2 


from the center. (See Figure 3.12.) Hence, we see that one focus must be at the 
origin, the location of the sun. Our proof is, therefore, complete. a 


Fortunately, all the toil involved in proving the first law will pay off in proofs 
of the second and third laws, which are considerably shorter. Again, we retain all 
the notation we already introduced. 


THEOREM 1.8 (KEPLER’S SECOND LAW) During equal intervals of time, a planet 
sweeps through equal areas with respect to the sun. 


Semiminor 
axis 


Semimajor axis 
A 


@ a 3 x 
Focus (-pe/(1-e?),0) Focus 


Figure 3.12 The ellipse of equation (16). 
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Po (ro, 90) 


P(r, 0) 


Figure 3.13 The shaded area A(@) is 
given by S ir’ dọ. 


PROOF Fix one point Po on the planet’s orbit. Then the area A swept between 
Po and a second (moving) point P on the orbit is given by the polar area integral 


8 1 A 
=f `r? dọ. 
w 2 


(See Figure 3.13.) Thus, we may reformulate Kepler’s law to say that dA/dt is 
constant. We establish this reformulation by relating d A /dt to a known constant, 
namely, the vector € = r x v. 

By the chain rule (in one variable), 


dA _ dA d9 
dt dð dt 
By the fundamental theorem of calculus, 


Z f irdp= rO 


dé d0 Jo, 2 
Hence, 
dA 1 , dé 
eae ee eee 17 
dt 2 dt a) 


Now, we relate c to d0 /dt by means of equation (9). Therefore, we compute 
u x du/dt in terms of 6. Recall that u = — r and r = r cos 0 i + r sin j. Thus, 
u = cosĝ i + sin} 
du dé do 


= — sind —i+cosd —j. 
i sin T + COS oe 


Hence, it follows by direct calculation of the cross product that 


dA 1 
ee, (18) 


a constant. | 
THEOREM 1.9 (KEPLER’S THIRD LAW) IfT is the length of time for one plane- 


tary orbit, and a is the length of the semimajor axis of this orbit, then T? = Ka?’ 
for some constant K. 
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3.1 Exercises 


PROOF We focus on the total area enclosed by the elliptical orbit. The area of an 
ellipse whose semimajor and semiminor axes have lengths a and b, respectively, 
is xab. This area must also be that swept by the planet in the time interval [0, T]. 
Thus, we have 


dt 
ri 
= godt by equation (18) 
0 
= 
Hence, 
2nab 4r?a?b? 
=Z? 90 Pa. (19) 
c c 


Now, b and c are related to a, so these quantities must be replaced before we are 
done. In particular, from equation (16), b? = p?/(1 — e?), so 


b = pa. 
Also 7 2 
P= GM 


(See equations (12) and (13).) With these substitutions, the result in (19) becomes 


_ 4ra (pa) _ 4r? m3 
pGM GM 


T2 


This last equation shows that T? is proportional to a*, but it says even more: 

The constant of proportionality 4277/GM depends entirely on the mass of the 

sun—the constant is the same for any planet that might revolve around the sun. 
| 


In Exercises 1—6, sketch the images of the following paths, us- 8. x(t) = Scosti+3sintj 


ing arrows to indicate the direction in which the parameter 


increases: 

x=2t-1 

E a : 

2. x(t) =eite‘'j 

3. eee 
y=tsint 
x = 3cost 

a: = 


5. x(t) = (t, 307 + 1,0) 


6. x(t) = (t, #7, ©) 


—6n <t < 67 


9. x(t) = (t sint, t cost, t°) 
10. x(t) = (e', e”, 2e") 
In Exercises 11—14, (a) use a computer to give a plot of the 
given path x over the indicated interval for t; identify the di- 


rection in which t increases. (b) Show that the path lies on the 
given surface S. 


Q 11. x(t) = B cost, 4sinzt, 21), —4 < t < 4; S is ellip- 


x? y? 
tical cylinder — + — = 1. 
ical cylinder 9 a 16 
@ 12. x(t) = (t cost, tsint, t), —20 < t < 20; S is cone 


z2 =x +y 


Calculate the velocity, speed, and acceleration of the paths Q 13. x(t) = (t sin 2t, t cos 2t, t°), —6 <t <6; S is para- 


given in Exercises 7—10. 


7. x(t) = Bt- 5i + (2t+7)j 


boloid z = x? + y?. 


@ 14. 


x(t) = (2cost, 2 sint, 3 sin 8t), 0 < t < 27; S is cy- 
linder x? + y? = 4. 


In Exercises 15—18, find an equation for the line tangent to the 
given path at the indicated value for the parameter. 


15. 
16. 
17. 
18. 
19. 


x(t) = tetit+te"j,r=0 

x(t) = 4costi—3sintj+5tk, t = 7/3 

x(t) = (P, t,t), t =2 

x(t) = (cos(e’), 3 — t°, t), t = 1 

(a) Sketch the path x(t) = (t, t° — 2t + 1). 

(b) Calculate the line tangent to x when t = 2. 

(c) Describe the image of x by an equation of the form 
y = f(x) by eliminating t. 

(d) Verify your answer in part (b) by recalculating the 
tangent line, using your result in part (c). 


Exercises 20-23 concern Roger Ramjet and his trajectory when 
he is shot from a cannon as in Example 6 of this section. 


20. 


21. 


22. 


23. 


24. 


Verify that Roger Ramjet’s path in Example 6 is indeed 
a parabola. 


Suppose that Roger is fired from the cannon with an 
angle of inclination 6 of 60° and an initial speed vp of 
100 ft/sec. What is the maximum height Roger attains? 


Suppose that Roger is fired from the cannon with an an- 
gle of inclination 0 of 60° and that he hits the ground 
1/2 mile from the cannon. What, then, was Roger’s 
initial speed? 


IfRoger is fired from the cannon with an initial speed of 
250 ft/sec, what angle of inclination 0 should be used 
so that Roger hits the ground 1500 ft from the cannon? 


Gertrude is aiming a Super Drencher water pistol at 

Egbert, who is 1.6 m tall and is standing 5 m away. 

Gertrude holds the water gun 1 m above ground at an 

angle a of elevation. (See Figure 3.14.) 

(a) If the water pistol fires with an initial speed of 
7 m/sec and an elevation angle of 45°, does Egbert 
get wet? 
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9 (b) If the water pistol fires with an initial speed of 


25. 


26. 


27. 


28. 


29. 
30. 


8 m/sec, what possible angles of elevation will 
cause Egbert to get wet? (Note: You will want to 
use a computer algebra system or a graphics cal- 
culator for this part.) 


A malfunctioning rocket is traveling according to the 
path x(t) = (e™, 3t? — 2t, t — +) in the hope of reach- 
ing a repair station at the point (7e*, 35, 5). (Here 
t represents time in minutes and spatial coordinates 
are measured in miles.) At t = 2, the rocket’s engines 
suddenly cease. Will the rocket coast into the repair 
station? 


Two billiard balls are moving on a (coordina- 

tized) pool table according to the respective paths 

x(t) = (? — 2, n = 1) and y(t) = (t, 5 — t?), where 

t represents time measured in seconds. 

(a) When and where do the balls collide? 

(b) What is the angle formed by the paths of the balls 
at the collision point? 


Establish part 1 of Proposition 1.4 in this section: If x 
and y are differentiable paths in R”, show that 


d b dx R dy 
x-y)=y- x-—. 
dt ae dt dt 
Establish part 2 of Proposition 1.4 in this section: If x 
and y are differentiable paths in R, show that 


n xXy+XxXXxX dy 
= — X FR 
dt i dt 


Prove Proposition 1.7. 


(a) Show that the path x(t) = (cost, cost sint, sin? t) 
lies on a unit sphere. 

(b) Verify that x(t) is always perpendicular to the ve- 
locity vector v(t). 


(c) Use Proposition 1.7 to show that if a differentiable 
path lies on a sphere centered at the origin, then 
its position vector is always perpendicular to its 
velocity vector. 


Figure 3.14 Figure for Exercise 24. 
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Consider the path 
x = (a + b cos wt) cos t 
y =(a+bcosa@t)sint, 


z = bsin wt 


where a, b, and w are positive constants and a > b. 


D (a) Use a computer to plot this path when 


32. 


33. 


34. 


i a=3,b=l1andwo=15. 
ii. a = 5, b = 1, and w = 15. 
iii. a = 5, b = 1, and w = 25. 


Comment on how the values of a, b, and w affect 
the shapes of the image curves. 


(b) Show that the image curve lies on the torus 


(Vx +y -af +27 =D". 
(A torus is the surface of a doughnut.) 


For the path x(t) = (e' cost, e' sin t), show that the an- 
gle between x(t) and x'(t) remains constant. What is 
the angle? 


Consider the path x: R > R?, x(t) = (t?, © — t). 
(a) Show that this path intersects itself, that is, that 
there are numbers ¢, and t such that x(t,) = x(t). 


(b) At the point where the path intersects itself, it 
makes sense to say that the image curve has two 
tangent lines. What is the angle between these tan- 
gent lines? 


Although the path x:[0,27]>R’, x(t)= 
(cost, sint) may be the most familiar way to give a 
parametric description of a unit circle, in this problem 
you will develop a different set of parametric equations 
that gives the x- and y-coordinates of a point on the 
circle in terms of rational functions of the parameter. 
(This particular parametrization turns out to be useful 
in the branch of mathematics known as number theory.) 

To set things up, begin with the unit circle x? + 
y? = 1 and consider all lines through the point (— 1, 0). 
(See Figure 3.15.) Note that every line other than the 


35. 


vertical line x = —1 intersects the circle at a point 
(x, y) other than (—1, 0). Let the parameter t be the 
slope of the line joining (—1, 0) and a point (x, y) on 
the circle. 


(1,0) 


Figure 3.15 Figure for Exercise 34. 


(a) Give an equation for the line of slope f joining 
(—1, 0) and (x, y). (Your answer should involve 
x, y, and t.) 


(b) Use your answer in part (a) to write y in terms of 
x and t. Then substitute this expression for y into 
the equation for the unit circle. Solve the resulting 
equations for x in terms of t. Your answer(s) for x 
will give the points of intersection of the line and 
the circle. 


(c) Use your result in part (b) to give a set of paramet- 
ric equations for points (x, y) on the unit circle. 


(d) Does your parametrization in part (c) cover the 
entire circle? Which, if any, points are missed? 


Let x(t) be a path of class C! that does not pass through 
the origin in R°. If x(tg) is the point on the image of x 
closest to the origin and x'(to) 4 0, show that the po- 
sition vector x(fo) is orthogonal to the velocity vector 
x’ (to). 


3.2 Arclength and Differential Geometry 


In this section, we continue our general study of parametrized curves in R, 
considering how to measure such geometric properties as length and curvature. 
This can be done by defining three mutually perpendicular unit vectors that form 
the so-called moving frame specially adapted to a path x. Our study takes us 
briefly into the branch of mathematics called differential geometry, an area where 
calculus and analysis are used to understand the geometry of curves, surfaces, 
and certain higher-dimensional objects (called manifolds). 


x(b) 


x(a) 
Figure 3.16 Approximating the 
length of a C! path. 
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Length of a Path 


For now, let x: [a,b] — R? be a C! path in R? Then we can approximate the 
length L of x as follows: First, partition the interval [a, b] into n subintervals. 
That is, choose numbers fo, t1, ..., tn such that a = tọ < ti <-:- < t, = b. If, 
fori = 1,...,n, we let As; denote the distance between the points x(t;—1) and 
x(t;) on the path, then 


Le > As;. (1) 
i=l 


(See Figure 3.16.) We have x(t) = (x(t), y(t), z(t)), so that the distance formula 
(i.e., the Pythagorean theorem) implies 


As; = (Ax? + Ay? + Az, 


where Ax; = x(t;) = X(t;-1), Ay; = y(ti) = y(ti—1), and Az; = z(ti) = z(ti—1). It 
is entirely reasonable to hope that the approximation in (1) improves as the At; ’s 
become closer to zero. Hence, we define the length L of x to be 


max At;—>0 


L= lim X} VAx;? + Ay? + Az’. (2) 
i=l 
Now, we find a way to rewrite equation (2) as an integral. On each subinterval 
[ti—1, ti], apply the mean value theorem (three times) to conclude the following: 
1. There must be some number t* in [t;—1, ti] such that 
x(ti) =e) = 2 OG = ta); 


that is, Ax; = x (YAK. 
2. There must be another number f;* in [t;—1, t;] such that 


Ayi = y (F) Ati. 
3. There must be a third number ¢t*** in [t;—1, t;] such that 
AZ = Ze )Ati. 


Therefore, with a little algebra, equation (2) becomes 


max At; 0 


L= lim fx? tyre + cry An. 3) 
i=l 


When the limit appearing in equation (3) is finite, it gives the value of the definite 
integral 


b 
| Vv? + ye + erat. 


Note that the integrand is precisely ||x’(t)||, the speed of the path. (This makes 
perfect sense, of course. Speed measures the rate of distance traveled per unit 
time, so integrating the speed over the elapsed time interval should give the total 
distance traveled.) Moreover, it’s not hard to see how we should go about defining 
the length of a path in R” for arbitrary n. 
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v 
x 
Figure 3.17 A C! path. 
x(t) x(t) 
x(b) 


x(a) 


Figure 3.18 A piecewise C! path 
x: [a, b] > RÈ. 


DEFINITION 2.1 The length L(x) ofa C! path x: [a, b] > R” is found by 
integrating its speed: 


b 
L(x) = / IX Oll dt. 


EXAMPLE 1 To check our definition in a well-known situation, we compute 
the length of the path 


x: [0,2m] > R?, x(t)=(acost,asint), a> 0. 
We have 
x(t) = —asinti+acostj, 


SO 


\|x’(t)|| = Va? sin? t + a2 cos?t =a. 


Thus, Definition 2.1 gives 
27 
L(x) = i adt =2ma. 
0 


Since the path traces a circle of radius a once, the length integral works out to be 
the circumference of the circle, as it should. + 


EXAMPLE 2 For the helix x(t) = (a cost, a sint, bt), 0 < t < 27, we have 
x(t)=—asinti+acostj+bk, 


so that ||x’(t)|| = va? + b?, and 
27 
L(x) = | va? + b? dt =2nvVa* +b’. 
0 


When b = 0, the helix reverts to a circle and the length integral agrees with the 
previous example. + 


Although we have defined the length integral only for C! (or “smooth- 
looking”) paths, there is no problem with extending our definition to the piecewise 
C! case. By definition, a C! path is one with a continuously varying velocity vec- 
tor, and so it typically looks like the path in Figure 3.17. A piecewise C! path is one 
that may not be C! but instead consists of finitely many C! chunks. A continuous, 
piecewise C! path that is not C! typically looks like the path in Figure 3.18. Each 
of the three portions of the path defined for (i) a < t < tı, Gi) ti < t < h, and 
Gii) t < t < b is ofclass C1, but the velocity, if nonzero, would be discontinuous 
att = tı and t = h. To define the length of a piecewise C! path, all we need do is 
break up the path into its C! pieces, calculate the length of each piece, and add to 
get the total length. For the piecewise C! path shown in Figure 3.18, this means we 
would take 


ti ty b 
[lvolas f iroja f [toar 


to be the length. 


P=x(t) 


Figure 3.19 The arclength 
reparametrization. 
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WARNING Even if a path is continuous, the definite integral in Definition 2.1 
may fail to exist. An example of such an unfortunate situation is furnished by the 
path x: [0, 1] > R’, 


1 
tsin- ift#0 
x(t) = (t, y(t)), where y(t)= t ; 
0 iff =0 


Such a path is called nonrectifiable. It is a fact that any C! path with endpoints 
is rectifiable, which is why we made such a condition part of Definition 2.1. 


The Arclength Parameter 


The calculation of the length of a path is not only useful (and moderately inter- 
esting) in itself, but it also provides a way for us to reparametrize the path with 
a parameter that depends solely on the geometry of the curve traced by the path, 
not on the way in which the curve is traced. 

Let x be any C! path and assume that the velocity x’ is never zero. Fix a point 
Po on the path and let a be such that x(a) = Po. We define a one-variable function 
s of the given parameter ¢ that measures the length of the path from Pp to any 
other (moving) point P by 


s(t) = if x(a) || dr. 


(See Figure 3.19. The Greek letter tau, t, is used purely as a dummy variable— 
the standard convention is never to have the same variable appearing in both the 
integrand and either of the limits of integration.) If t happens to be less than a, 
then the value of s in formula (4) will be negative. This is nothing more than a 
consequence of how the “base point” Pp is chosen. 

Here’s how to get the new parameter: From formula (4) and from the funda- 
mental theorem of calculus, 


Dk OP ies EE E sal 5 
Tifo rajko = speed. (5) 


Since we have assumed that x’(t) 4 0, it follows that ds/dt is nonzero. Hence, 
ds/dt is always positive, so s is a strictly increasing function of t. Thus, s is, 
in fact, an invertible function; that is, it is at least theoretically possible to solve 
the equation s = s(t) for t in terms of s. If we imagine doing this, then we can 
reparametrize the path x, using the arclength parameter s as independent variable. 


EXAMPLE 3 For the helix x(t) = (a cost, a sint, bt), if we choose the “base 
point” Pp to be x(0) = (a, 0, 0), then we have 


t ia 
=f old = f ya? + b?dt = ya? + b?t, 
0 0 
so that 


s = va? +b?t, 
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or 
sS 
Va +b? 
(What the preceding tells us is that this reparametrization just rescales the time 
variable.) Hence, we can rewrite the helical path as 


P 5 bs 
x(s) = (acos (+) .asin(—*_), *_). . 


EXAMPLE 4 The explicit determination of the arclength parameter for a given 
parametrized path is a delicate matter. Consider the path 


_ V2 5,135 
wna (Fs 4°). 


Then x'(t) = (1, /2t, t?) and, if we take the base point to be x(0) = (0, 0, 0), then 


t 
= f VvV1l4+2r2+1t4dt 
0 


t t t3 
=f Vatar = | a+ eyarar4 E. 
0 0 3 


On the other hand, the path y(t) = (t, t?, t°) is quite similar to x, yet it has 
no readily calculable arclength parameter. In this case, y’(t) = (1, 2t, 3t?) and the 
resulting integral for s(t) is 


t 
s(t) = / Vv¥14+4r?24 914 dt. 
0 


It can be shown that this integral has no “closed form” formula (i.e., a formula 
that involves only finitely many algebraic and transcendental functions). + 


The significance of the arclength parameter s is that it is an intrinsic param- 
eter; it depends only on how the curve itself bends, not on how fast (or slowly) 
the curve is traced. To see more precisely what this means, we resort to the chain 
rule. Consider s as an intermediate variable and ¢ as a final variable. Then we 
have 


d 
x(t) = vo by the chain rule, 
=x(s)|x(I|_ by (5). 
Since x’(t) Æ 0, we can solve for x’(s) to find 


t 
tos 
IIx’@)Il 

Therefore, x’(s) is precisely the normalization of the original velocity vector, and 
so itis aunit vector. Hence, the reparametrized path x(s) has unit speed, regardless 
of the speed of the original path x(t). (This result makes good geometric sense, 
too. If arclength, rather than time, is the parameter, then speed is measured in 
units of “length per length,” which necessarily must be one.) 

The only unfortunate note to our story is that the integral in formula (4) is 
usually impossible to compute exactly, thus making it impossible to compute s 
as a simple function of t. (The case of the helix is a convenient and rather special 


(6) 


Figure 3.20 A unit tangent 
vector. 
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exception.) One generally prefers to work indirectly, letting the chain rule come 
to the rescue. We shall see this indirect approach next. 


The Unit Tangent Vector and Curvature 
Let x: 7 C R > R? bea C? path and assume that x’ is never zero. 


DEFINITION 2.2 The unit tangent vector T of the path x is the normal- 
ization of the velocity vector; that is, 
v x’(t) 
Ivi xO 


We see from Definition 2.2 that the unit tangent vector is undefined when the 
speed of the path is zero. Also note that, from equation (6), T is dx/ds, where s 
is the arclength parameter. Geometrically, T is the tangent vector of unit length 
that points in the direction of increasing arclength, as suggested by Figure 3.20. 


EXAMPLE 5 For the helix x(t) = (a cost, a sint, bt), we have 


TO= x(t) _—asinti+acostj+bk 
Ix ©) va? + b? l 


On the other hand, if we parametrize the helix using arclength so that 


P 5 bs 
x(s) = («co (=) (=) TF) 


then 
T(s) =x(s) = — sin ( : Ji l os ( 5 )i 
Veto Yate)” late \ ate 
TE, 
This agrees (as it should) with the first expression for T, since s = va? + b? t, 
as shown in Example 3. + 


Using the unit tangent vector, we can define a quantity that measures how 
much a path bends as we travel along it. To do so, note the following key facts: 


PROPOSITION 2.3 Assume that the path x always has nonzero speed. Then 


1. dT/dt is perpendicular to T for all ¢ in J (the domain of the path x). 


2. |idT/dt|| |: equals the angular rate of change (as t increases) of the direc- 
tion of T when ft = fo. 


PROOF (You can omit reading this proof for the moment if you are interested in 
the main flow of ideas.) To prove part 1, we have 


T(t): T(t) = 1, 
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T(t) 


Figure 3.21 The vector triangle 
used in the proof of 
Proposition 2.3. 


since T is a unit vector. Hence, 
d 
—(T-T)=0, 
rr ) 


because the derivative of a constant is zero. Also we have 


by the product rule (Proposition 1.4). Thus, 


dT 
2T-— =0. 
dt 
Therefore, T is always perpendicular to dT /dt. (See Proposition 1.7.) 
Now we prove part 2. Because T is a unit vector for all t, only its direction 
can change as t increases. This angular rate of change of T is precisely 
AO 
im —, 
At>0+ At 
where A0 comes from the vector triangle shown in Figure 3.21. To make the 
argument technically simpler, we shall assume that AT 4 0. We claim that 


(7) 


lm —— = 
aro JATI] 


Then, from equation (7), 


_ AO : A0 ||AT]| i Ad . | AT 
lim — = lim — = lim —— —. 
Ar>0+ At ato ||AT]| At At>0+ ||AT|| Arsor At 
AT 
Hit 
At>0+ At 


Since Af is assumed to be positive in the limit, we may conclude that 
A0 AT dT 

im — = lim |—] = | — 

At>0+ At At>0+ || At dt 


as desired. 
To establish equation (7), the law of cosines applied to the vector triangle in 
Figure 3.21 implies 


ATI? = ITE + Ad? + ITI? — 2TH + ADI TOI cos Ad 
= 2 — 2 cos AQ, 


because T is always a unit vector. Thus, 


. A0 . A0 
lim —— = lim — 
At>0* ||AT]| Art 4/2 — 2 cos AO 
. A0 
lim 
At=>0+ 


[2 Xsin?(A0/2)) 
from the half-angle formula, and so 
A0 , A0/2 
im ——= lim —— =l, 
At—>0+ || AT] — Aror sin(A@/2) 


from the well-known trigonometric limit (or from L Hôpital’s rule). a 
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Part 2 of Proposition 2.3 provides a precise way of measuring the bending of 
a path. 


DEFINITION 2.4 The curvature « of a path x in R? is the angular rate of 
change of the direction of T per unit change in distance along the path. 


The reason for taking the rate of change of T per unit change in distance in the 
definition of « is so that the curvature is an intrinsic quantity (which we certainly 
want it to be). Figure 3.22 should help you develop some intuition about x. 


T 


T 


Figure 3.22 In the left figure, « is not large, since the 
path’s unit tangent vector turns only a small amount per 
unit change in distance along the path. In the right 
figure, « is much larger, because T turns a great deal 
relative to distance traveled. 


Because ||dT/dt|| measures the angular rate of change of the direction of T 
per unit change in parameter (by part 2 of Proposition 2.3) and ds /dt is the rate 
of change of distance per unit change in parameter, we see that 


a ear a 


(dT /dt| | dT 


ds ||’ 


where the last equality holds by the chain rule. It is formula (8) that we will use 
when making calculations. 


EXAMPLE 6 For the circle x(t) = (a cost, a sint), 0 < t < 27, 
ds 


x(t) = —asinti+acostj, IXO = r~ 
so that 
TO x(t) nti re 
= =-—sintl+cosfJ. 
xoll 
Hence, 
a L costi — sint j|| = : 
~ ds/dt a uag 


Thus, we see that the curvature of a circle is always constant with value equal 
to the reciprocal of the radius. Therefore, the smaller the circle, the greater the 
curvature. (Draw a sketch to convince yourself.) + 
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EXAMPLE 7 Ifa and b are constant vectors in R? and a Æ 0, the path 


x(t) =at+b 
traces a line. We have 
x(t) =a, 
so 
a _ tal 
ae 
Hence, 
T(t) = ESSN 
llall 


which is a constant vector. Thus, T’(t) = 0 and formula (8) implies immedi- 
ately that k is zero, which agrees with the intuitive fact that a line doesn’t 
curve. r 


EXAMPLE 8 Returning to our friend the helix 


x(t) = (a cost, a sint, bt), 


we have already seen that 


= =a? +b and T(t)= ~a sintit+ acostj+bk 


dt [ae +b? 
Thus, formula (8) gives 
_ 1 —acosti—asintj} a 
ar +B? Ja? +b? | eae 


We see that the curvature of the helix is constant, just like the circle. In fact, as b 
approaches zero, the helix degenerates to a circle, and the resulting curvature is 
consistent with that of Example 6. 

We can also compute the curvature from the parametrization given by arc- 
length. The same helix is also described by 


F s bs 
x(s) = (« cos (+=). oon (= + z) va? + =) 


and we have 


dx a i sS , a S , 
To= pn yare (seep) ereh (eee)! 
+ — k 
We can, therefore, compute 
dT 


a S ; a : S : 
ds ~ ae (laze)! aan (Tae) h 


and hence, from formula (8), that 


dT a 
E= — = = zi 
ds a? + b? 
which checks. + 
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The Moving Frame and Torsion 


We now introduce a triple of mutually orthogonal unit vectors that “travel” with a 
given path x: Z > R?, known as the moving frame of the path. (Note: In general, 
the term “frame” means an ordered collection of mutually orthogonal unit vectors 
in R”.) These vectors should be thought of as a set of special vector “coordinate 
axes” that move from point to point along the path. 

To begin, assume that (i) x’(t) Æ 0 and (ii) x’(t) x x(t) Æ 0 for all ¢ in 7. 
(The first condition assures us that x never has zero speed and the second that x 
is not a straight-line path.) Then the first vector of the moving frame is just the 
unit tangent vector: 

dx x(t) 
ds ||x’(a)ll 
(Now you see why condition (i) is needed.) For a second vector orthogonal to T, 
recall that part 1 of Proposition 2.3 says that dT/dt must be perpendicular to T. 
Hence, we define 


dT /dt 


\|dT/dt\| 


(That dT/dt is not zero follows from assumptions (1) and (11).) The vector N is 
called the principal normal vector of x. By the chain rule, N is also given by 


_ aT/ds 
ldT/ds|| 
Since x = ||dT/ds || by formula (8), we also see that 


(10) 


At a given point P along the path, the vectors T and N (and also the vectors 
x’ and x”) determine what is called the osculating plane of the path at P. (See 
Figure 3.23.) This is the plane that “instantaneously” contains the path at P. (More 


x 


Osculating plane 


Figure 3.23 The osculating plane of the path x at the 
point P. 
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precisely, it is the plane obtained by taking points P; and P on the path near P 
and finding the limiting position of the plane through P, Pı, and P) as P, and 
P, approach P along x. The word “osculating” derives from the Latin osculare, 
meaning “to kiss.”) 

Now that we have defined two orthogonal unit vectors T and N, we can 
produce a third unit vector perpendicular to both: 


B=TxN. (12) 


The vector B, called the binormal vector, is defined so that the ordered triple 
(T, N, B) is a right-handed system. Thus, B is a unit vector since 


. IE 
PB = EIN sit S te ta 


EXAMPLE 9 For the helix x(t) = (a cost, a sint, bt), the moving frame vec- 
tors are 


—asinti+acostj+bk 


/ a2 + b2 


T(t) = 


(as we have already seen), 


T(t) _ (-acosti—asintj)/Va* + b* 


N(t) = Tol > a/v Le = —costi-—sintj, 
and 
i j k 
B(t)=TxN=| —asint/ Va +b  acost/Vaz2+b? b/Va2? +b? 
— cost —sint 0 


b b a 
————  sint ]}i-— | ——~cost j+ ——— |k. 
(ae ) (Jae )s (Jaze) i 


Equation (11) says that the derivative of T (with respect to arclength) is a 
scalar function (namely, the curvature) multiple of the principal normal N. This 
is not surprising, since N is defined to be parallel to the derivative of T. A more 
remarkable result (see the addendum at the end ofthis section) is that the derivative 
of the binormal vector is also always parallel to the principal normal; that is, 


dB ; 
~F (scalar function) N. 
s 


The standard convention is to write this scalar function with a negative sign, so 
we have 


The scalar function t thus defined is called the torsion of the path x. Roughly 
speaking, the torsion measures how much the path twists out of the plane, how 
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“three-dimensional” x is. Note that, according to our conventions, the curvature 
kK is always nonnegative (why?), while t can be positive, negative, or zero. 


EXAMPLE 10 Consider again the case of circular motion. Thus, let x(t) = 
(a cost, a sin t). Then, as shown in Example 6, 


'(t 1 
T(t) = x) =-—sinti+costj, and «= |Z =— 
IIx) a 
Now we calculate 
T(t) ae ae 
N= 7 = — costi — sint j, 
IIT) 


B=TxN=k, aconstant vector. 


Hence, dB/ds = 0, so there is no torsion. This makes sense, since a circle does 
not twist out of the plane. + 


EXAMPLE 11 Let x(t) = (e cost, e' sint, e’). We calculate T, N, and B and 
identify the curvature and torsion of x. 
To begin, we have 
x(t) _ æ (cost — sint)i + e'(cost + sint)j + e k 


T(t) = xol — J3et 


1 
= ((cost — sin t)i + (cost + sint)j+k). 
WE ( dit ( j4 

From this, we may compute 
dT dT/dt zint + cost)i+ (cost — sint) j) 
ds ds/dt Je 


=f 


Lins + cost)i+ (cost — sin tj), 


so that the curvature is 


me ae 
Now we determine the remainder of the moving frame: 
TO = L (Gsint + c0s1)i + (eost — sinn) j) 
So = sint + cos t)i + (cost — sin t) j), 
ITO] v2 
1 
B = T x N = —((sint — cos t)i — (sint + cost) j + 2k). 
V6 


Finally, to find the torsion, we calculate 
dB dB/dt (cost + sint)i + (sint — cost) j) 
ds  ds/dt Jet 


((cost + sint)i-+ (sint — cost) j) 


sO e 
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Figure 3.24 Any vector in the 
plane perpendicular to T can be 
used for N. 


EXAMPLE 12 Ifa and b are vectors in R?, then the straight-line path x(t) = 
at + b has, as we saw in Example 7, T = a/|lal|. Thus, bothdT/dt anddT/ds are 
identically zero. Hence, x = 0 (as shown in Example 7) and N cannot be defined 
using formula (9). From geometric considerations, any unit vector perpendicular 
to T can, in principle, be used for N. (See Figure 3.24.) If we choose one such 
vector, then B can be calculated from formula (12). Since T, N, and B are all 
constant, tT must be zero. This is an example of a moving frame that is not 
uniquely determined by the path x and serves to illustrate why the assumption 
x’ x x” Æ 0 was made. + 


It is important to realize that the moving frame, curvature, and torsion are 
quantities that are intrinsic to the curve traced by the path. That is, any parame- 
trized path that traces the same curve (in the same direction) must necessarily 
have the same T, N, B vector functions and the same curvature and torsion. This 
is because all of these quantities can be defined entirely in terms of the intrinsic 
arclength parameter s. (See Definition 2.2 and formulas (6), (8), (10), (11), (12), 
and (13).) 

Another important fact is that the curvature function « and the torsion function 
t together determine all the geometric information regarding the shape of the 
curve, except for the curve’s particular position in space. To be more precise, we 
have the following theorem, whose proof we omit: 


THEOREM 2.5 Let s be the arclength parameter and suppose Cı and C3 are 
two curves of class C? in R*. Assume that the corresponding curvature functions 
kı and kz are strictly positive. Then if «1(s) = k2(s) and t1(s) = Ta(s), the two 
curves must be congruent (in the sense of high school geometry). In fact, given 
any two continuous functions « and t, where «(s) > 0 for all s in the closed 
interval [0, L], there is a unique curve parametrized by arclength on [0, L] (up to 
position in space) whose curvature and torsion are « and T, respectively. 


Tangential and Normal Components of Velocity and Accel- 
eration; Other Curvature Formulas 


As we have seen, the moving frame provides us with an intrinsic set of vectors, 
like coordinate axes, that are special to the particular curve traced by a path. In 
contrast, the velocity and acceleration vectors of a path are definitely not intrinsic 
quantities but depend on the particular parametrization chosen as well as on the 
shape of the path. (The speed of a path is entirely independent of the geometry 
of the curve traced.) We can get some feeling for the relationship between the 
intrinsic notion of the moving frame and the extrinsic quantities of velocity and 
acceleration by expressing the latter two vector functions in terms of the moving 
frame vectors. 

Thus, we begin with a C? path x: Z > R? having x’ Æ 0 and x’ x x” 40. 
For notational convenience, let å denote ds/dt and § denote d*s/dt*. From 
Definition 2.2, we know that T = v/||v|| and so, since the speed $ = ds/dt = ||v\l, 
we have 


Figure 3.25 Decomposition of 
acceleration a into tangential and 
normal components. 
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This formula says that the velocity is always parallel to the unit tangent vector, 
something we know well. To obtain a similar result for acceleration, we can 
differentiate (14) and apply the product rule: 


d dT 
t)=v(t) = —(6T) = ST +5—. 15 
a(t) = v(t) ae re, (15) 
Next, we express dT/dt in terms of the T, N, B frame. Formula (11) gives 


the derivative of dT/ds in terms of N. The chain rule says that dT/ds = 
(dT/dt)/(ds/dt). Thus, from formula (11), we have 


Hence, we may rewrite equation (15) as 


a(t) = ST + «s°N. 


WARNING § = d’s/dt? is the derivative of the speed, which is a scalar function. 
The acceleration a is the derivative of velocity and so is a vector function. 


Note that formula (16) shows that the acceleration has no component in the 
direction of the binormal vector B. Therefore, both velocity and acceleration are 
vectors that lie in the osculating plane of the path. (See Figure 3.25.) 

At first glance, it may not appear to be especially easy to use formula (16) 
to resolve acceleration into its tangential and normal components because of the 
curvature term. However, 


lall? = a-a = GT + K?N) - (5T + «PN) = $° + (Ks?Y, 


since T and N are perpendicular vectors. Consequently, we may calculate the 
components as follows: 


Tangential component of acceleration = atang = 5. 


Normal component of acceleration = anorm = Ke = x ilal? — toe 


EXAMPLE 13 Let x(t) = (t, 2t, t°). Then v(t) = i+ 2j + 2tk and a(t) = 2k. 
We have $ = ||v(t)|| = V5 + 422. Therefore, 


R 4t 
dtang =s= J5 42 
Since ||a|| = 2, we see that 
162? 2/5 
norm = lall? = Gaus a ? 


5F4 S5 FAR 


Formulas (14) and (16) enable us to find an alternative equation for the 
curvature of the path. We simply calculate that 


v xa = (ST) x (ST + «8?N) = $5(T x T) + «8°(T x N) = ks°B. 
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Recalling that $ = ||v||, we have, by taking magnitudes, 
Iv x all = «llv[)° IBI] = «llvil’, 


since B is a unit vector. Thus, 


_ iv xall 


Ivi? 


This relatively simple formula expresses the curvature (an intrinsic quantity) 
in terms of the nonintrinsic quantities of velocity and acceleration. 


EXAMPLE 14 For the path x(t) = (277 + 1, t+, t°), we have 
v(t) = 6f7i + 4j + 5t*k 
and 
a(t) = 12ti + 1217j + 200°k. 


You can check that 


Ivii = t2,/25¢4 + 1612 + 36 


and 


Iv x al] = |4¢4(517i — 15tj + 6k) || = 4r*,/2514 + 22512 + 36. 
Therefore, formula (17) yields 


_ lv x all _ 4(2514 + 22512 + 36)!/2 
~ v £2(25t4 + 16t? + 36)3/2 ” 


which is certainly a more convenient way to determine curvature in this case. ® 


Summary 


You have seen many formulas in this section, and, at first, it may seem difficult 
to sort out the primary statements from the secondary results. We list the more 
fundamental facts here: 


For a path x: ] > R°: 


Nonintrinsic quantities: 


Velocity v(t) = x’(t). 
d 
Speed — = IVI. 


Acceleration a(t) = x” (t). 


Figure 3.26 The arclength 
function. 


Figure 3.27 w(s)=aT + 
bN + cB. 
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Arclength function: (See Figure 3.26.) 


SOS iP \|x'(t)||dt (basepoint is Py = x(a)) 


Intrinsic quantities: 
The moving frame: 


i dx x(t) 
Unit tangent vector T = — = : 
ds |Ix’(o)ll 
eee dT/ds dT/dt 
Principal normal vector N = = : 
|dT/ds||_ |\dT/dt|| 
Binormal vector B = T x N. 


dT 
ds 


_ lldT/4t|| 


Curvature k = | AOE 


dB 
Torsion tT is defined so that ve = —tN. 
5 


Additional formulas: 
VOO SSS speed). 
a(t) = §T+x«s?N (šis derivative of speed). 


Iv x all 
IIvil? 


Addendum: More About Torsion and the 
Frenet-Serret Formulas 


We now derive formula (13), the basis for the definition of the torsion of a curve. 
That is, we show that the derivative of the binormal vector B (with respect to 
arclength) is always parallel to the principal normal N (i.e., that dB/ds is a 
scalar function times N). The two main ingredients in our derivation are part | of 
Proposition 2.3 and the product rule. 

We begin by noting that, since the ordered triple of vectors (T, N, B) forms a 
frame for R?, any moving vector, including dB/ds, can be expressed as a linear 
combination of these vectors; that is, we must have 


“ = a(s)T + b(s)N + c(s)B, (18) 
where a, b, and c are appropriate scalar-valued functions. (Because T, N, and 
B are mutually perpendicular unit vectors, any (moving) vector w in R? can be 
decomposed into its components with respect to T, N, and B in much the same 
way that it can be decomposed into i, j, and k components—see Figure 3.27.) To 
find the particular values of the component functions a, b, and c, it turns out that 
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we can solve for each function by applying appropriate dot products to equation 
(18). Specifically, 


dB 
ae >T=a(s)T-T+ D(s)N-T + c(s)B-T 
s 
=a(s)-1+bd(s)-0+ c(s)-0 
= a(s), 
and, similarly, 
dB dB 
2N =b B= cls). 
N=), B= cl) 


From Proposition 1.7, dB/ds is perpendicular to B and, hence, c must be zero. 
To find a, we use an ingenious trick with the product rule: Because T - B = 0, it 
follows that d/ds(T - B) = 0. Now, by the product rule, 


La. B)=T- dB m dT p 
~ ds ds 
Consequently, (dB/ds)-T = —(dT/ds)-+B. Thus, 
B 
a(s) = dB _aT -B 
ds ds 
= —-KN-B by formula (11), 
=0, 
and equation (18) reduces to 
dB 
— = b(s)N. 
L (s) 


No further reductions are possible, and we have proved that the derivative of B is 
parallel to N. The torsion t can, therefore, be defined by t(s) = —b(s). 

Formulas (11) and (13) gave us intrinsic expressions for dT/ds and dB/ds, 
respectively. We can complete the set by finding an expression for dN/ds. The 
method is the same as the one just used. Begin by writing 


“ = a(s)T + b(s)N + c(s)B, (19) 


where a, b, and c are suitable scalar functions. Taking the dot product of equation 
(19) with, in turn, T, N, and B, yields the following: 


A dN aN- 
°T, b(s) = —-N, — 
a(s) = (=> ce(s) = = 
The “product rule ae used here then reveals that 
dN dT 
= -T=—_N.- 
ays) ds ds 
= —N-kKN by formula (11) 
cas aes 
and 
N dB 
c(s) = -B=-N.- 
ds 


= —N-(-tN) by formula (13) 


=T. 


3.2 Exercises 


Calculate the length of each of the paths given in Exercises 
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Moreover, we may differentiate the equation N- N = 1 to find 


dN dN 
b(s) = -N=-—N- È 
(s) ds ds 


which implies that b(s) is zero. Hence, equation (19) becomes 


dN 
— = —KT 4+ TB. 


ds 
The formulas for dT/ds, dN/ds, and dB/ds are usually taken together as 
T(s)=KN 
N'(s)= —-xkT+7TB 
B'(s)= —tN 


and are known as the Frenet—Serret formulas for a curve in space. They are so 
named for Frédéric-Jean Frenet and Joseph Alfred Serret, who published them 
separately in 1852 and 1851, respectively. The Frenet—Serret formulas give a 
system of differential equations for a curve and are key to proving a result like 
Theorem 2.5. They are often written in matrix form, in which case, they have an 
especially appealing appearance, namely, 


T 0 Kk 0 T 
N |=|- 0 r N 
B’ 0 =r 0 B 


for the length of the curve y = f(x) between (a, f(a)) 


1-6. and (b, f(b)). 
1. x(t) = 2t+1,7-3t),-l1<t<2 11. Use Exercise 10 or Definition 2.1 (or both) to calculate 
. . the length of the line segment y = mx + b between 
32 2 3/2 
ae =o ee eke (Xo, yo) and (x1, y1). Explain your result with an ap- 
3. x(t) = (cos 3r, sin 3t, 2£?),0 <t <2 propriate sketch. 
4. x(t)=7it+tjt+ ?k,1<t<3 12. (a) Calculate the length of the line segment deter- 
mined by the path 
5. x(t) = (£, 327, 6t) —1 <t <2 
t) = (aıt + bi, aat + b 
6. x(t) = (ln (cost), cost, sint), % <t < 3 ees Piya E 
as t varies from to to fy. 
7. x(t) = (Int, 17/2, /2t), 1 <t <4 weer ae l 
(b) Compare your result with that of Exercise 11. 
= 7 2 

8. x(t) = (2t cost, 2t sint, 2V21),0 <1 <3 (c) Now calculate the length of the line segment deter- 
9. The path x(t) = (a cos? t, a sin? t), where a is a posi- mined by the path x(t) = at + b as ¢ varies from 

tive constant, traces a curve known as an astroid or a to to ty. 

hypocycloid of four cusps. Sketch this curve and find 13. This problem concerns the path x = |t — l]it |r|j, 

its total length. (Be careful when you do this.) tape? 

10. If f is a continuously differentiable function, show (a) Sketch this path. 
how Definition 2.1 may be used to establish the (b) The path fails to be of class C! but is piecewise 
formula C!. Explain. 
b (c) Calculate the length of the path. 
= / 2 
L= / 1+ (FŒ) dx 14. Consider the path x(t) = (e™* cost, e™ sint). 
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(a) Argue that the path spirals toward the origin as 
t > +00. 


(b) Show that, for any a, the improper integral 


/ IIx(|| dt 


(c) Interpret what the result in part (b) says about the 
path x. 


converges. 


Suppose that a curve is given in polar coordinates by 
an equation of the form r = f(0), where f is of class 
C!. Use Definition 2.1 to derive the formula 


p 
L =| vV f'(OY + fey do 


for the length of the curve between the points ( f (œ), a) 

and (f(8), 6) (given in polar coordinates). 

(a) Find the arclength parameter s = s(t) for the path 
x(t) = e” cos bti+e sinbtj + e” k. 


(b) Express the original parameter t in terms of s and, 
thereby, reparametrize x in terms of s. 


Determine the moving frame {T, N, B}, and compute the cur- 
vature and torsion for the paths given in Exercises 17—20. 


17. 
18. 


19. 
20. 
21. 


22. 


x(t) = 5cos3ti+ 6t j + 5sin3tk 


x(t) = (sint — t cos t)i + (cost + t sint)j + 2k, 
t>0 


x(t) = (t, i0 + 197, 1a -tP -1 <t <1 
x(t) = (e” sint, e” cost, 1) 


(a) Use formula (17) in this section to establish the 
following well-known formula for the curvature 
of a plane curve y = f(x): 

c IFO 
I+ OG) ll 


(Assume that f is of class C?.) 


(b) Use your result in (a) to find the curvature of 


y = ln (sin x). 


(a) Let x(s) = (x(s), y(s)) be a plane curve para- 
metrized by arclength. Show that the curvature is 
given by the formula 


n ‘lk 


c= |x! y” —x"y 


path x and, separately, plot the curvature « as a function of t 
over the indicated interval for t and value(s) of the constants. 


Q 23. 
Q 24. 


Q 25. 


Q 26. 


x(t)=(acost,bsint), O<t<2m; a=2,b=1 


x(t) = (2a(1 + cost) cost, 2a(1+ cosf)sint), 0< 


t<2n; a=] 


x(t) = (2acost(1 + cost) — a, 2a sin t(1 + cos t)), 
0<t<27; a=1 


x(t) = (asinnt,bsinmt), O<t<2m; a=3, 


b=2,n=4,m=3 


Find the tangential and normal components of acceleration for 
the paths given in Exercises 27—32. 


27. 
28. 
29. 
30. 
31. 
32. 
33. 


34. 


35. 


36. 


37. 


(b) Show thatx(s) = ($(1 —s?), (cos! s —sVT=s?)) 


is parametrized by arclength, and compute its 
curvature. 


In Exercises 235-26, (a) use a computer algebra system to cal- 
culate the curvature k of the indicated path x and (b) plot the 


38. 


x(t)=ti+tj 

x(t) = (2t, e”) 

x(t) = (e cos2t, e' sin 2t) 

x(t) = (4 cos 5t, 5 sin 4t, 3t ) 

x(t) = (t, t, t°) 

x(t) = 2(1 —cost)i+sintj+ + costk 

(a) Show that the tangential and normal compo- 


nents of acceleration atang aNd anorm satisfy the 
equations 


Ix’ xx" 


X °X 
dtang = | x’ | ’ norm = | x’ | . 


(b) Use these formulas to find the tangential and 
normal components of acceleration for the path 
x(t) = (t +2)i +t j+3tk. 

Use Exercise 33 to show that, for the plane curve 

y= f(x), 

FOF") 
Atang = E ea 
v1 +E) 
[œ| 
norm = Begom i 
v1 +E) 
Establish the following formula for the torsion: 
_ (vx a)-a’ 
lyxa? 

Show that xt = —T' + B’, where differentiation is with 

respect to the arclength parameter s. 

Show that if x is a path parametrized by arclength and 

x’ x x” Æ 0, then 

Ket = (x’ x x”) ex". 

Suppose x: Z —> R? is a path with x'(t) x x” (t) £ 0 for 

allt € I. The osculating plane to the path at t = tọ is 


the plane containing x(f)) and determined by (i.e., par- 
allel to) the tangent and normal vectors T(tọ) and N(fo). 


The rectifying plane at ft = tọ is the plane contain- 

ing x(fo) and determined by the tangent and binormal 

vectors T(fo) and B(fo). Finally, the normal plane at 

t = tọ is the plane containing x(fo) and determined by 

the normal and binormal vectors N(tọ) and B(to). Note 

that both the osculating and rectifying planes may be 
considered to be tangent planes to the path at fo since 
they are both parallel to T(to). 

(a) Show that B(tọ) is perpendicular to the osculating 
plane at fo, that N(tọ) is perpendicular to the rec- 
tifying plane at fo, and that T(fo) is perpendicular 
to the normal plane at fo. 


(b) Calculate the equations for the osculating, rec- 
tifying, and normal planes to the helix x(t) = 
(a cost, a sint, bt) at any to. (Hint: To speed your 
calculations, use the results of Example 9.) 
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As a result, we can arrange T, N, and B in a circle so 
that they correspond, respectively, to the vectors i, j, 
k appearing in Figure 1.54 and so that we may use a 
mnemonic for identifying cross products that is similar 
to the one described in Example 1 of §1.4. 


Let x be a path of class C*, parametrized by arclength s, 
with x’ x x" + 0. We define the Darboux rotation vector (also 
called the angular velocity vector) by 


w= tT+xB. 


Note that w(so) is parallel to the rectifying plane to x(so). The 
direction of the Darboux vector w gives the axis of the “screw- 
like” motion of the path x and its length gives the angular 
velocity of the motion. Exercises 43—45 concern the Darboux 


vector. 

39. Recall that the equation for a sphere of radius a > 0 43. Show that ||w|| = vK? + T7. (Hint: The vectors T, N 
and center xọ may be written as ||x — Xo|| = a. (See and B are pairwise orthogonal.) a 
Example 15 of §2.1.) Explain why the image of a path 
x with the property that 44. (a) Use the Frenet—Serret formulas to establish the 

(x(t) _ Xo) 3 (x(t) _ Xo) = a Darboux formulas: 
for all ¢ must lie on a sphere of radius a. T=wxT 

40. Letx be a path with x’ x x” 4 0 and suppose that there 
is a point Xo that lies on every normal plane to x. Show N =wxN 
that the image of x lies on a sphere. (See Exercise 38 g 

; B =wxB. 
concerning normal planes to paths.) 

41. Use the result of Exercise 40 to show that x(t) = (b) Use the Darboux formulas to establish the Frenet- 
(cos 2r, — sin 2t, 2 cost) lies on a sphere by showing Serret formulas. Hence the two sets of equations 
that (1, 0, 0) lies on every normal plane to x. are equivalent. (Hint: Use Exercise 42.) 

42. Use the result of Exercise 27 of §1.4 to show that 45. Show that x is a helix if and only if w is a constant 

NxXB=T and BxT=N. vector. (Hint: Consider w’ and use Theorem 2.5.) 
3.3 Vector Fields: An Introduction 
We begin with a simple definition. 
DEFINITION 3.1 A vector field on R” is a mapping 
y F: X CR" > R’. 
F(x) 


Ce Fe 
File se leila 


Figure 3.28 The constant vector 
field F(x) = i + j. 


We are concerned primarily with vector fields on R? or R°. In such cases, we 
adopt the point of view that a vector field assigns to each point x in X a vector 
F(x) in R”, represented by an arrow whose tail is at the point x. This perspective 
allows us to visualize vector fields in a reasonable way. 


EXAMPLE 1 Suppose F: R? — R? is defined by F(x) = a, where a is a con- 
stant vector. Then F assigns a to each point of R?, and so we can picture F by 
drawing the same vector (parallel translated, of course) emanating from each point 
in the plane, as suggested by Figure 3.28. + 
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Gea CE) EXAMPLE 2 Let's depict G: R? > R?, G(x, y) = yi — xj. We can begin to do 
this by calculating some specific values of G, as in the adjacent table. However, 
(0, 0) 0 it is difficult to get much of a feeling for G as a whole in this way. To understand 
T y =J G somewhat better, we need to “play around” a bit. Note that 
, i 
1,1 i-j F í 
ei 2 IGG, yl = lyi — xjl = Vy? 2? = Irl, 


where r = xi + yj, the position vector of the point (x, y). From this observation, 
it follows that G has constant length a on the circle x? + y? = a’. In addition, 
y we have 


r: G(x, y) = xi + yj): i — xj) = 0. 


Hence, G(x, y) is always perpendicular to the position vector of the point (x, y). 
These facts, together with a table like the preceding one, make it possible to see 
that G looks like Figure 3.29. + 


REMARK Sometimes a scalar-valued function f: X C R” —> R is called a scalar 
field. One thinks of a vector field on R” as attaching vector information (such 
as wind velocity) to each point and a scalar field as attaching real number infor- 
mation (such as temperature or pressure). We’ll use the term “scalar field” only 
occasionally, but we don’t want to shock you when we do. 


Figure 3.29 The vector field 
G(x, y) = yi — xj of Example 2. 


EXAMPLE 3 Letr = xi + yj + zk. The so-called inverse square vector field 
in R? is a function F: R? — {0} —> R? given by 


C 
F(x, y, z) = ——<Yr, 
Irl 
where c is any (nonzero) constant. Ifthe term “inverse square” seems inappropriate 
to you, we’ll try to convince you otherwise. Set u = r/||r|| so that r = ||r|/u. Then 


F is given by 
c c r c 
F(x, y,z) = r= ( ) ( ) = u. (1) 
Irl? Irl? / \rll IIr ||? 


Therefore, F is a vector field whose direction at the point P(x, y, z) Æ (0, 0, 0) 
z is parallel to the vector from the origin to P and whose magnitude is inversely 
proportional to the square of the distance from the origin to P. Note that F points 
away from the origin if c is positive and toward the origin if c is negative. 
We have seen an example of an inverse square field in §3.1—namely, the 
Newtonian gravitational field between two bodies. If one of the bodies is at the 
origin and the other at (x, y, z), then we have 


GMm 
——u 
II ||? 


In this case, the proportionality constant c is -GMm, which is negative. This 
means that the gravitational force is attractive (1.e., it points in the direction 
Figure 3.30 An inverse square that reduces the distance between the two bodies). Such a vector field is shown in 
vector field. Figure 3.30. An example ofa repelling inverse square field is the electrostatic force 
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between two particles with like static charges (both positive or both negative). 
This force is expressed by Coulomb’s law, 


k 
F= ae 
lirl 


’ 


where r is the vector from particle 1 (at the origin) to particle 2, u = r/||r||, 
qı and q2 are the respective charges (positive or negative) on the particles, and 
k is a constant appropriate for the units being used. In mks units, distance is 
measured in meters, charge in coulombs, force in newtons, so that k is equal to 
8.9875 x 10? Nm?/C?. e 


Gradient Fields and Potentials - 


Inverse square fields are interesting not only for their origin in basic physical 
situations, but also because they are examples of gradient fields. A gradient field 
on R” is a vector field F: X C R” —> R” such that F is the gradient of some 
(differentiable) scalar-valued function f: X — R. That is, 


F(x) = V f(x) 


at all x in X. The function f is called a (scalar) potential function for the vector 
field F. To see what this means in the case of the inverse square field (1), we write 
out the components of F explicitly: 


c c xi + yj + zk 
F= —J u= , 
Ir|| x+y? +2277 \ /y2 4 y2 4 22 
since r = xi + yj + zk and u = r/||r||. That is, 
CX oe cy P CZ k 
(x2 + y2 + 22)3/2 Lai (x2 + y2 + 72)3/2 Jo (x2 + y2 + 72)3/2 i 


We leave itto you to check that F(x, y, z) = V f (x, y, z), where f: R? — {0} > R 
is given by 


F(x, y, z) = 


ë _¢€ 
Vx? py +e Irl 


REMARK In physics and engineering, a negative sign is often introduced in the 
definition of a potential function (i.e., so that a potential function g for a vector 
field F is one such that F = —V g). The motivation behind such a convention is 
that in physical applications, it is desirable to have the potential function rep- 
resent potential energy in some sense. For example, in the case of the gravita- 
tional field F = —(GMm/||r||*)u, a physicist would take the potential function to 
be —GMm_/|r||, not +GMm/||r|| as we do. The advantage to the physicist in 
doing so is that the physicist’s potential function increases with increasing ||r||. 
This corresponds to the notion that the greater the distance between two bodies, 
the greater should be the stored gravitational potential energy. 


fœ, y,z)= 


From Theorem 6.4 of Chapter 2 we know that the gradient of any C! scalar- 
valued function f: X C R” — R is perpendicular to the level sets of f. Thus, if 
F is a gradient vector field on R”, F(x) must be perpendicular to the level set of a 
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Isotherms 


Figure 3.31 A gradient vector field F = V f. Equipotential 
lines are shown where f is constant. 


potential function of F containing the point x. If f is such a potential function, the 
level set {x | f(x) = c} is called an equipotential set (or equipotential surface if 
n = 3, or equipotential line if n = 2) of the vector field F. (See Figure 3.31.) 

You’ve seen examples of equipotential lines every time you’ve looked at a 
weather map. Usually curves of constant barometric pressure (called isobars) or 
of constant temperature (isotherms) are drawn. (See Figure 3.32.) Perpendicular 
to such equipotential lines are associated gradient vector fields that point in the 
direction of most rapid increase of pressure or temperature. 


Flow Lines of Vector Fields 


When you draw a sketch of a vector field on R? or R, it is easy to imagine 
that the arrows represent the velocity of some fluid moving through space as in 
Figure 3.33. It’s natural to let the arrows blend into complete curves. What you’re 


PARTLY 
SUNNY 


Figure 3.32 A weather map. (Weather graphics courtesy of Accuweather, Inc. 385 Figure 3.33 A fluid moving 
Science Park Road, State College, PA 16803. (814) 237-0309. © 2011. Used with through space. 


permission.) 


Lo Vector field F 


Figure 3.34 A flow line. 


= 


Figure 3.35 The vector field 
F(x, y, z) = 2i — 3j + k of 
Example 4. 


Figure 3.36 Flow lines of 
F(x, y) = —yi + xj of Example 5. 
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doing analytically is drawing paths whose velocity vectors coincide with those of 
the vector field. 


DEFINITION 3.2 A flow line ofa vector field F: X C R” — R” is a differ- 
entiable path x: Z — R” such that 


x(t) = F&A). 


That is, the velocity vector of x at time t is given by the value of the vector 
field F at the point on x at time t. (See Figure 3.34.) 


EXAMPLE 4 We calculate the flow lines of the constant vector field 
F(x, y, z) = 2i — 3j + k. 

A picture of this vector field (see Figure 3.35) makes it easy to believe that 
the flow lines are straight-line paths. Indeed, if x(t) = (x(t), y(t), z(t)) is a flow 
line, then, by Definition 3.2, we must have 


xX(D = K'O, y0, z) = 2, —3, 1) = Fae). 


Equating components, we see 


x'(t)=2 
y) = -3. 
z(t) = 1 
These differential equations are readily solved by direct integration; we obtain 
x(t) = 2t + xo 
y(t) = —3t + yo, 
z(t) =t+ zo 


where xo, yo, and Zo are arbitrary constants. Hence, as expected, we obtain para- 
metric equations for a straight-line path through an arbitrary point (xo, yo, Zo) 
with velocity vector (2, —3, 1). + 


EXAMPLE 5 Your intuition should lead you to suspect that a flow line of the 

vector field F(x, y) = — yi + xj should be circular as shown in Figure 3.36. Indeed, 

if x: [0, 277) —> R? is given by x(t) = (a cost, a sin t), where a is constant, then 
x (t) = —asinti+acostj = F(a cost, a sint), 

so such paths are indeed flow lines. 


Finding all possible flow lines of F(x, y) = —yi+ xj is a more involved task. 
If x(t) = (x(t), y(t)) is a flow line, then, by Definition 3.2, we must have 


x(t) =x Hity O = —YOi+ xj = FX). 
Equating components, 
fee = —y(t) 
YO=x) | 


This is an example of a first-order system of differential equations. It turns out 
that all solutions to this system are of the form 


x(t) = (a cost — b sint, a sint + b cost), 
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where a and b are arbitrary constants. It’s not difficult to see that such paths trace 
circles when at least one of a or b is nonzero. + 


In general, if F is a vector field on R”, finding the flow lines of F is equivalent 
to solving the first-order system of differential equations 
xO) = Filt), x26), - - - , Xn(t)) 
x(t) = Fo(x1 (t), x2(t), ---, Xn(t)) 


y(t) = Fn(xi(@), x2(t), -+> Xn E) 


for the functions x;(f), ..., X,(t) that are the components of the flow line x. (The 
function F; is just the ith component function of the vector field F.) Such a 
problem takes us squarely into the realm of the theory of differential equations, a 


fascinating subject, but not of primary concern at the moment. 


3.0 Exercises 


In Exercises 1—6, sketch the given vector fields on R?. 


1. F= yi— xj 
2. F=xi- yj 
3. F = (—x, y) 
4. F = (x, x’) 
5. F = (x?, x) 
6. F=(y’, y) 


In Exercises 7-12, sketch the given vector field on RÈ. 
7. F=3i+ 2j+k 
8. F = (y, —x, 0) 
9. F = (0, z, —y) 
10. F = (y, —x, 2) 
11. F = (y, —x, z) 
y : x 


i j 
(Orr re yty ae 


Zz 
< 


+ ——=— k 
af ee eye FZ 
In Exercises 13-16, use a computer to plot the given vector 
fields over the indicated ranges. 


Q 13. F=(x-y,x+y); -l<x<1,-l<y<l 
Q 14. F=0%x,x°y); -2<x<2, -2<y<2 


> 15. F = (x sin y, ycosx); —2r <x < 2r, 
—2n < y < 2x 


@ 16. F = (cos (x — y), sin(x + y); —20 < x < 2x, 
—2n < y < 2x 


12. F= 


In Exercises 17—19, verify that the path given is a flow line of 
the indicated vector field. Justify the result geometrically with 
an appropriate sketch. 


17. x(t) = (sint, cost, 0), F = (y, —x, 0) 
18. x(t) = (sint, cost, 2t), F = (y, —x, 2) 
19. x(t) = (sint, cost, e”), F = (y, —x, 2z) 
In Exercises 20-22, calculate the flow line x(t) of the given 


vector field F that passes through the indicated point at the 
specified value of t. 


20. F(x, y)=—xi+ yj; x(0)= (2,1) 
21. Fx, y) = 0, y); x(1) =(1,e) 
22. F(x, y,z) = 2i—3yj+2k; x(0)= (3,5,7) 
23. Consider the vector field F = 3i — 2j + k. 
(a) Show that F is a gradient field. 


(b) Describe the equipotential surfaces of F in words 
and with sketches. 


24. Consider the vector field F = 2x i + 2y j — 3k. 
(a) Show that F is a gradient field. 


(b) Describe the equipotential surfaces of F in words 
and with sketches. 


25. If x is a flow line of a gradient vector field F = V f, 
show that the function G(t) = f(x(t)) is an increasing 
function of t. (Hint: Show that G'(t) is always non- 
negative.) Thus, we see that a particle traveling along 
a flow line of the gradient field F = V f will move 
from lower to higher values of the potential function 
f. That’s why physicists define a potential function of 
a gradient vector field F to be a function g such that 
F = —V¢g (i.e., so that particles traveling along flow 
lines move from higher to lower values of g). 
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LetF: X C R" — R’ bea continuous vector field. Let (a, b) be 27. Verify that 


an intervalin R that contains 0. (Think of (a, b) as a “time inter- 
val.”) A flow of F is a differentiable function ġ: X x (a,b) > 


R” ofn + 1 variables such that 


$: R? x R> R’, 


(x, y, t) = (y sint + x cost, y cost — x sint) 


EAT N t) =F); x, 0) =x is a flow of the vector field F(x, y) = (y, —x). 
ot l , : 


Intuitively, we think of (x, t) as the point at time t on the flow 
line of F that passes through x at time 0. (See Figure 3.37.) 


28. Verify that 
¢:R? x R > R?, 


Thus, the flow of F is, in a sense, the collection of all flow lines $x, y, z, t) = (x cos 2t — y sin 2t, y cos 2t 
of F. Exercises 26-31 concern flows of vector fields. : j 


+x sin2t, ze‘) 


is a flow of the vector field F(x, y,z) = —2yi+ 


FO, 1) 2xj— zk. 


/ Í (x, t) S 29. Show that if ¢: X x (a, b) > R” is a flow of F, then, 


Z 


¢(x, 0) =x 4 


gra 


for a fixed point xo in X, the map x: (a, b) > R” given 
by x(t) = (x9, t) is a flow line of F. 


30. If @ is a flow of the vector field F, explain why 
lox, t), s) = (x, s + t). (Hint: Relate the value of 
the flow ¢ at (x, t) to the flow line of F through x. You 


Figure 3.37 The flow of the vector field F. may assume the fact that the flow line of a continuous 


26. Verify that 
o:R? x R> R’, 


vector field at a given point and time is determined 
uniquely.) 


31. Derive the equation of first variation for a flow of a 
vector field. That is, if F is a vector field of class C! 


b(x, y, t) = ( ; Lap We = y eT, with flow ¢ of class C?, show that 
ə 
HERI E a) ZD, 1) = DFN, 1) DxG(X, 1). 
—— e+ —e 
2 2 Here the expression “D,@(x, t)? means to differentiate 
f ith t to th iabl ,X2,...,Xp, that is, 
sabroe ae oe a a 


3.4 Gradient, Divergence, Curl, and the Del 
Operator 

In this section, we consider certain types of differentiation operations on vector 
and scalar fields. These operations are as follows: 

1. The gradient, which turns a scalar field into a vector field. 

2. The divergence, which turns a vector field into a scalar field. 

3. The curl, which turns a vector field into another vector field. (Note: The curl 

will be defined only for vector fields on R3.) 


We begin by defining these operations from a purely computational point of view. 
Gradually, we shall come to understand their geometric significance. 


The Del Operator 


The del operator, denoted V, is an odd creature. It leads a double life as both 
differential operator and vector. In Cartesian coordinates on R3, del is defined by 
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the curious expression 


The “empty” partial derivatives are the components of a vector that awaits suitable 
scalar and vector fields on which to act. Del operates on (1.e., transforms) fields 
via “multiplication” of vectors, interpreted by using partial differentiation. 

For example, if f: X C R? > R is a differentiable function (scalar field), 
the gradient of f may be considered to be the result of multiplying the vector V 
by the scalar f, except that when we “multiply” each component of V by f, we 
actually compute the appropriate = derivative: 


Ce o ee Ofc Əf. OF 
Vf(x,y¥,2) = [i +j ea oe le 
Ox By | Ox ay dz 
The del operator can also be defined in R”, for arbitrary n. If we take 
X1,X2,...,X, to be coordinates for R”, then del is simply 


where e; = (0,...,1,...,0),i = 1,..., n, is the standard basis vector for R”. 


The Divergence of a Vector Field 


Whereas taking the gradient of a scalar field yields a vector field, the process of 
taking the divergence does just the opposite: It turns a vector field into a scalar 
field. 


DEFINITION 4.1 Let F:X C R” — R” be a differentiable vector field. 
Then the divergence of F, denoted div F or V- F (the latter read “del dot 


F”), is the scalar field 
OF OF: OF, 
diwi = Yo = — E e soo do LE, 
Ox 0x2 Ox, 
where x|,...,X, are Cartesian coordinates for R and F\,..., F,, are the 


component functions of F. 


It is essential that Cartesian coordinates be used in the formula of Definition 4.1. 
(Later in this section we shall see what div F looks like in cylindrical and spherical 
coordinates for R°.) 


EXAMPLE 1 If F = x?yi + xzj + xyzk, then 


; ð ð ð 
div F = —(x’y)4 (xz) 4 (xyz) = 2xy + 0 + xy = 3xy. e 
ax dy dz 


\ 


ge, a ge 
Pet elie, a 


Figure 3.38 The vector field 
F = xi+ yj of Example 2. 


NV 
AN 


Figure 3.39 The vector field 
G = —xi — yj of Example 2. 
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The notation for the divergence involving the dot product and the del operator 
is especially apt: If we write 


F= Fie, + Foe +. of + Fren, 


then, 
0 0 0 
V-F=(e + ez b+ ++, - (Fie; + Foer + +--+ Fren) 
Ox, 0X2 OXn 
_ OF; i a Fa i i OF, 
7 Chal 0x2 i i OXn 


where, once again, we interpret “multiplying” a function by a partial differential 
operator as performing that partial differentiation on the given function. 

Intuitively, the value of the divergence of a vector field at a particular point 
gives a measure of the “net mass flow” or “flux density” of the vector field in 
or out of that point. To understand what such a statement means, imagine that 
the vector field F represents velocity of a fluid. If V - F is zero at a point, then 
the rate at which fluid is flowing into that point is equal to the rate at which 
fluid is flowing out. Positive divergence at a point signifies more fluid flowing out 
than in, while negative divergence signifies just the opposite. We will make these 
assertions more precise, even prove them, when we have some integral vector 
calculus at our disposal. For now, however, we remark that a vector field F such 
that V - F = 0 everywhere is called incompressible or solenoidal. 


EXAMPLE 2 The vector field F = xi + yj has 


ð ð 
V.F = —(x) + —(y) =2. 
Ox dy 


This vector field is shown in Figure 3.38. At any point in R?, the arrow whose 
tail is at that point is longer than the arrow whose head is there. Hence, there is 
greater flow away from each point than into it; that is, F is “diverging” at every 
point. (Thus, we see the origin of the term “divergence.”) 

The vector field G = —xi — yj points in the direction opposite to the vector 
field F of Figure 3.38 (see Figure 3.39), and it should be clear how G’s divergence 
of —2 is reflected in the diagram. + 


EXAMPLE 3 The constant vector field F(x, y, z) = a shown in Figure 3.40 
is incompressible. Intuitively, we can see that each point of R? has an arrow 
representing a with its tail at that point and another arrow, also representing a, 
with its head there. 

The vector field G = yi — xj has 


ð ð 
v= ONA ay x) =0. 


A sketch of G reveals that it looks like the velocity field of a rotating fluid, without 
either a source or a sink. (See Figure 3.41.) + 


The Curl of a Vector Field 
If the gradient is the result of performing “scalar multiplication” with the del 
operator and a scalar field, and the divergence is the result of performing the 
“dot product” of del with a vector field, then there seems to be only one simple 
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Figure 3.40 The constant vector 
field F = a. 


< 


<="? 


A 
D 


Figure 3.41 The vector field 
G = yi — xj resembles the 
velocity field of a rotating fluid. 


differential operation left to be built from del. We call it the curl of a vector field 


and define it as follows: 


F”), is the vector field 


DEFINITION 4.2 Let F: X C R? > R? be a differentiable vector field on 
R? only. The curl of F, denoted curl F or V x F (the latter read “del cross 


0 
dy 
i j 
=| hie aay 
F, Fy 


ð ð 
culF=VxF=([(i + j tk x (Fii + Foj + F3k) 
Ox dz 


0/0z 


_ (eis 0h ia 
~ \ ay Oz i 


in OES Vay Cra or k 
dz Ox Ja Ox dy 


There is no good reason to remember the formula for the components of the 
curl—instead, simply compute the cross product explicitly. 


EXAMPLE 4 IfF = x?yi — 2xzj + (x + y —2)k, then 
i j k 

VxF= | d/dx a/dy 0/dz 
xy —2xz x+y-z 


3 a ə ə 
= (Że ee) al 2x2) (Fun ape ty a)i 


ð a 
(= 2x2) * (wy) k 


= (1+ 2x)i— j — (x? + 2z)k. 
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Figure 3.42 A twig in a pond where water moves with velocity given by a vector field F. In the left figure, the twig does 
not rotate as it travels, so curl F = 0. In the right figure, curl F ¥ 0, since the twig rotates. 


One would think that, with a name like “curl,’ V x F should measure how 
much a vector field curls. Indeed, the curl does measure, in a sense, the twisting 
or circulation of a vector field, but in a subtle way: Imagine that F represents the 
velocity of a stream or lake. Drop a small twig in the lake and watch it travel. 
The twig may perhaps be pushed by the current so that it travels in a large circle, 
but the curl will not detect this. What curl F measures is how quickly and in what 
orientation the twig itself rotates as it moves. (See Figure 3.42.) We prove this 
assertion much later, when we know something about line and surface integrals. 
For now, we simply point out some terminology: A vector field F is said to be 
irrotational if V x F = 0 everywhere. 


EXAMPLE 5 Let F = (3x?z + y?)i + 2xy j + (x? — 2z)k. Then 
i j k 
VxF=| ə3/əx a/dy 0/dz 
3x°z + y? 2xy x? = 2z 


0 ð 0 d 
- (Že 2z) (xy) ) i (Forcty) ze 22)) j 


ə ð 
| (Zew pe? DL 


= (0 — 0)i+ (Bx? — 3x”)j + (2y — 2y)k = 0. 


Thus, F is irrotational. + 


Two Vector-analytic Results 
It turns out that the vector field F in Example 5 is also a gradient field. Indeed, 
F = V f, where f(x, y, z) = xz + xy? — z?. (We’ll leave it to you to verify 
this.) In fact, this is not mere coincidence but an illustration of a basic result 
about scalar-valued functions and the del operator: 


THEOREM 4.3 Let f: X C R? —> R be of class C?. Then curl (grad f) = 0. 
That is, gradient fields are irrotational. 
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PROOF Using the del operator, we rewrite the conclusion as 
Vx(Vf)=0, 


which might lead you to think that the proof involves nothing more than noting 
that V f is a “scalar” times V, hence, “parallel” to V, so that the cross product must 
be the zero vector. However, V is not an ordinary vector, and the multiplications 
involved are not the usual ones. A real proof is needed. 

Such a proof is not hard to produce: We need only start calculating V x (V f). 
We have 


Therefore, 
i j k 
Vx(Vf)=| 0/dx a/day 0/dz 
af/ax odf/dy  əƏf/ðz 


PF PFN, rf PFN, oF Pf )k 
= 1 | 
ðyðz  dzdy azəx  3xəz)/! \dxdy  dydx 
Since f is of class C*, we know that the mixed second partials don’t depend 


on the order of differentiation. Hence, each component of V x (V f) is zero, as 
desired. ia 


There is another result concerning vector fields and the del operator that is 
similar to Theorem 4.3: 


THEOREM 4.4 Let F: X C R? > R? be a vector field of class C?. Then 
div (curl F) = 0. That is, curl F is an incompressible vector field. 


The proof is left to you. 


EXAMPLE 6 If F = (xz — e™ cosz)i— yzj +e” (sin y + 2sinz)k, then 


ð 0 ð 
V.F= = (xz — ee cos z) 4 ay! yz) + zz (e™(sin y + 2 sinz)) 


= z — 2e** cosz — z + 2e** cosz = 0 
for all (x, y, z) € R?. Hence, F is incompressible. We’ll leave it to you to check 


that F = V x G, where G(x, y, z) = e” cos yi + e” sin z j + xyz k, so that, in 
view of Theorem 4.4 the incompressibility of F is not really a surprise. + 


Other Coordinate Formulations (optional) 


We have introduced the gradient, divergence, and curl by formulas in Cartesian 
coordinates and have, at least briefly, discussed their geometric significance. Since 
certain situations may necessitate the use of cylindrical or spherical coordinates, 
we next list the formulas for the gradient, divergence, and curl in these coordinate 
systems. Before we do, however, a remark about notation is in order. Recall that 
in cylindrical coordinates, there are three unit vectors e,, eg, and e, that point in 
the directions of increasing r, 0, and z coordinates, respectively. Thus, a vector 
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field F on R? may be written as 
F= F,e, + Foes + Fez. 


In general, the component functions F,, Fy, and F, are each functions of the 
three coordinates r, 0, and z; the subscripts serve only to indicate to which of 
the vectors e,, eg, and e, that particular component function should be attached. 
Similar comments apply to spherical coordinates, of course: There are three unit 
vectors €p, €y, and eg, and any vector field F can be written as 


F = Fre, + Fyeg + Foes. 


THEOREM 4.5 Let f:X C R? > R and F:Y C R? — R? be differentiable 
scalar and vector fields, respectively. Then 


af. laf, af 


V = f H T Z9 3 
G apt a 6) 
; 1f a dFy ə 
div F = F,)4 | Fy 4 
as r Fg ) 00 an J a) 
l e, reg e€ 
curl F = z| a/ar ð/Ə@  ə3/ðz | (5) 


F, r Fo F: 


PROOF We’ll prove formula (4) only, since the argument should be sufficiently 
clear so that it can be modified to give proofs of formulas (3) and (5). The idea is 
simply to rewrite all rectangular symbols in terms of cylindrical ones. 
From the equations in (8) of §1.7, we have 
e = cosĝ i + sinô j 
eg = — sin ĝ i + cos 0 j. (6) 
e =k 
From the chain rule, we have the following relations between rectangular and 
cylindrical differential operators: 


ð 0 : 0 
— =cosd— + sind— 
or ox dy 


a 
= —r sin Hr cos 
Ox 0 


00 
da 
az az 
These relations can be solved algebraically for 0/dx, 0/dy, and 0/0z to yield 
a sind ð 
or r 06 


a 

= ie (7) 
dy or r 00 
ə 

əz 
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Hence, we can use (6) and (7) to rewrite the expression for the divergence of a 
vector field on R: 


ð 0 a 
V-F= i+ jṣ4 k | - (Fe, + Fgeo + F:ez) 
Ox dy Oz 


, ð sind ð .{. 9 cos@ a a 
= |i | cos 8 + j | sind tk 
or r 06 or r 06 az 


- [((F, cos — Fy sin@)i+ (F, sin + Fg cos 8) j + F; kK]. 


(We used the equations in (7) to rewrite the partial operators 0/dx, 0/dy, and 
d/0z appearing in del and the equations in (6) to replace the cylindrical basis 
vectors e,, €, and e, by expressions involving i, j, and k.) Performing the dot 
product and using the product rule yields 


ð sin ð . 
V.F = |cos0— — — — ] (F, cos — Fy sin) 
or r 90 


. 90 cosé ð f ð 
+ {sin — + (F, sin0 + Fg cos 0) + — F; 
or r 00 Oz 


or 


r 


OF, ð sind OF, ð 
= cos ð | cos + F, gr C 0) cos 0 HF, T (cos 0) 
r 


ə 
Foz (sin 0) 


OF, 3 in 0 aF 
~cos6 (sind —* + Fy—(sin6)) + ~~ (sina —° 
or or 


r 


OF, a, 0/(. OF, a, 
+sin@ (sine + F.2(sine) = (sino HF, Z (ino) 
r 


or r 00 
+ sin@ (cos oS + Fo Tpu 0) + = (cos 0 + Fo ae 0) 
or or r 00 00 
MLE 
əz 


After some additional algebra, we find that 


. 2 2 

0 0 

sin“ 0 + cos )e 
r 


OF, 
V -F = (cos? 0 + sin’ 6) 5 ( 
F 


00 Oz 


| (pees) OFy , OF; 
i r 


OF, 1, | 13Fọ , aF, 


as desired. E 


In spherical coordinates, the story for the gradient, divergence, and curl is 
more complicated algebraically, although the ideas behind the proofare essentially 
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the same. We state the relevant results and leave to you the rather tedious task of 
verifying them. 


THEOREM 4.6 Let f:X ¢ R? > R and F: Y C R? = R? be differentiable 
scalar and vector fields, respectively. Then the following formulas hold: 


of 1 ol a 1 of 


Vf= e, 4 | - €9; 8 
f ap ° | pay © psing a0 | (8) 
1 ə 0 1 d Fo 
V-F= ——(p°F,) + —— —(sing F) + —— -—; 0 
p? ap ° o) p sing TA 9 Fo) p sing 90 ©) 
ep Pep pP Singe 
1 
VxF= su | 0/0p  3/ðğ 0/00 . (10) 
p? sing 
F, pFy pp sing Fo 
3.4 Exercises 
Calculate the divergence of the vector fields given in Exer- y 
cises 1-6. 
i ee a a A TI TEZ 
— 42° 2s ae 2s Sy ICD oe EEE Loe 
k Eaei 2 R= yitr] SE hoy eR ee 
P Se NS NOt {eH A Ke oH 
3. F=(x+yi+ O +zj+(x+z)k SN wow KN tte Sf FF ota 
4. F =zcos(e” jit xV/22 + lj + e” sin3xk preach A 
5. F = xe; +2x3e2 +--+ + nxen ee ee 
6. F = xie; + 2x162 +--+: +x e, AN EEN Parea 
Find the curl of the vector fields given in Exercises 7—11. T f ; i Ì i i : h : on 
J4 , ALLIR IANS AN 
7. F= xfi- xe j+2xyzk 


8. F = xi + yj + zk 


Figure 3.43 Vector field for Exercise 13(a). 


9. F = (x + yz)i + (y + xz)j + (z + xy)k 


10. F = (cos yz — x)i + (cos xz — y)j + (cos xy — z)k 


11. F= y zi+ e j+ x? yk 


y 

12. (a) Consider again the vector field in Exercise 8 and EE EEN ae ee 

its curl. Sketch the vector field and use your pic- SANSA Wa | ee ee ee 

ture to explain geometrically why the curl is as you See ae ea |e On eer a 

calculated. ESSE Jipa igis 

(b) Use geometry to determine V x F, where F = ae aa 
(xi + yj + zk) 3 x 

(PEP EE PERS A E 

(c) For F as in part (b), verify your intuition by explic- eee pres 

itly computing V x F. EOC SES) TRE SN Seas 

PE EPL FE DR 1 \ NENNE» 

"E E E AE E E a a a a a 


13. Can you tell in what portions of R’, the vector fields 


shown in Figures 3.43—3.46 have positive divergence? 


Negative divergence? 


Figure 3.44 Vector field for Exercise 13(b). 
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SA ROR ee er 
i a a A SN E d 
na SA mo e e a 
naie ss 8 ee a 
St a a ee er 
es oe ii we ee ee 
Xx 
=r- - -=-= =- =-= 
rr Sy ee 
awa ee -`s 
m a a m E 
PERET SR Se Se Se 


a ae oe ee ee ee a ee ec a a S 


aX Ree SS | ew ee ew we we 


Figure 3.46 Vector field for Exercise 13(d). 


14. Check that if f(x, y, z) = x? sin y + y? cosz, then 
V x(Vf)=0. 


15. Check that if F(x, y, z) = xyzi — æ cos xj + xy?z*k, 
then 


V-(V x F)=0. 
16. Prove Theorem 4.4. 


In Exercises 17-20, letr = xi + yj + z kand letr denote ||r||. 
Verify the following: 


17. Vr" =nr"?r 
18. V(Inr) = — 
7 
19. V. (r"r)= (n+ 3)r” 
20. V x(r"r) =0 


In Exercises 21—25, establish the given identities. (You may 
assume that any functions and vector fields are appropriately 


differentiable.) 
21. V-(F+G)=V-F+V:°:G 


22. Vx(F+G)=VxF+VxG 

23. V. (JF) = fV-F+F-Vf 

24. Vx(fF)=fVxF+VfxF 

25. V-(FxG)=G-VxF-F-VxG 
26. Prove formulas (3) and (5) of Theorem 4.5. 


27. Establish the formula for the gradient of a function in 
spherical coordinates given in Theorem 4.6. 


28. The Laplacian operator, denoted V?, is the second- 
order partial differential operator defined by 


2 o ane 

=" y ae’ 

(a) Explain why it makes sense to think of V? as V - V. 
(b) Show that if f and g are functions of class C?, then 


W(fe=fWetev f +2UY f Vg). 
(c) Show that 
V-(fVg—sVf)= fV’8 -8V f. 
29. Show that V - (fV f) = |V fI? + fV? f. 


30. Show that V x (V x F) = V(V-F)—V’°F. (Here 
V?F means to take the Laplacian of each component 
function of F.) 


Let X be an open set in R”, F: X C R” — R” a vector field 
on X, and a € X. Ifv is any unit vector in R", we define the 
directional derivative of F at a in the direction of v, denoted 
DyF(a), by 


D,F(a) = lim LFG + hv) — F(a)), 


provided that the limit exists. Exercises 31—34 involve direc- 
tional derivatives of vector fields. 


31. (a) In analogy with the directional derivative of a 
scalar-valued function defined in §2.6, show that 


d 
D,F(a) = ae + tv) 
t=0 


(b) Use the result of part (a) and the chain rule to show 
that, if F is differentiable at a, then 


D,F(a) = DF(ajv, 


where v is interpreted to be ann x 1 matrix. (Note 
that this result makes it straightforward to calculate 
directional derivatives of vector fields.) 


32. Show that the directional derivative of a vector field 
F is the vector whose components are the directional 


33. 


True/False Exercises for Chapter 3 


derivatives of the component functions F\,..., Fa of 
F, that is, that 
D,F(a) = (D,F (a), D, F(a), e.’ Dy F,(a)). 


LetF = yzi + xzj + xy k. Find Di-j+o/ FG. 2; 1): 
(Hint: See Exercise 31.) 


. Ifa path x remains a constant distance from the origin, 


then the velocity of x is perpendicular to x. 


. Ifa path is parametrized by arclength, then its velocity 


vector is constant. 


. Ifa path is parametrized by arclength, then its velocity 


and acceleration are orthogonal. 


d j 
-zg FOIS KOM. 


(x X y) Ham Ma 
Te amen” gy YS 
dT dB 
= | — It] — 
dt ds 


8. The curvature « is always nonnegative. 


9. The torsion t is always nonnegative. 


10. N= wld 
N= zy 
11. Ifa path x has zero curvature, then its acceleration is 


12. 


13. 


14. 
15. 
16. 


Miscellaneous Exercises for Chapter 3 


1. 


always parallel to its velocity. 


Ifa path x has a constant binormal vector B, then t = 0. 


d?s\* as\* 
(33) +e (2) = |la(s)||?. 


grad f is a scalar field. 
div F is a vector field. 


curl F is a vector field. 


Figure 3.47 shows the plots of six paths x in the plane. 
Match each parametric description with the correct 
graph. 

(a) x(t) = (sin 2t, sin an 


( 
(b) x(t) = (t + sin a t? + cos 6t) 
(c) x(t) =(? +1, P-t) 
(d) x(t) = (2t + n 4t, t — sin 5t) 
© D= t-t, t-t) 


(f) x(t) = (sin G A sin 3+), cos t) 


34. 


17. 
18. 
19. 
20. 


21. 


22. 


23. 


24. 
25. 
26. 
27. 
28. 


29. 


30. 
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Let F = xi + yj + z k. Show that D,F(a) = v for any 
point a € R? and any unit vector v € R?. More gener- 
ally, if F = (x1, x2, . . . , Xn), a = (41, @2, . . . , An), and 
v = (v1, v2, .. . , Vn), show that D,F(a) = v. 


grad(div F) is a vector field. 

div(curl(grad f)) is a vector field. 

grad f x div F is a vector field. 

The path x(t) = (2 cost, 4 sint, t) is a flow line of the 


vector field F(x, y, z) = Fit 2xj+zk. 


The path x(t) = (e' cost, e' (cost + sint), e' sint) isa 
flow line of the vector field F(x, y, z) = (x — z)i + 
2xj+ yk. 


The vector field F = 2xy cos zi — y? cos zj + e® kis 
incompressible. 


The vector field F = 2xy cos zi — y? coszj +e” kis 
irrotational. 


V x (V f) = 0 for all functions f: R? > R. 
If V -F = 0 and V x F = 0, then F = 0. 
V. (F xG)=F.(V x G)+G+(V xF). 
If F = curl G, then F is solenoidal. 


The vector field F= 2x sin y cos z i + x? cos y cos z 
j+x°sin ysinzk is the gradient of a function f of 
class C?. 


There is a vector field F of class C? on R? such that 
V x F = x cos? yi + 3yj — xyz’ k. 


If F and G are gradient fields, then F x G is incom- 
pressible. 


. Figure 3.48 shows the plots of six paths x in R3. Match 


each parametric description with the correct graph. 
(a) x(t) = (t + cos 3r, t? + sin 5t, sin 4t) 

(b) x(t) = (2 cos? t, 3 sin? t, cos 2t) 

(c) x(t) = (15 cost, 23 sint, 4t) 

(d) x(t) = (cos 3t, cos 5t, sin 4t) 

(e) x(t) = (2t cost, 2t sint, 4t) 

O x(t) = (7 +1, P -t,t — 27) 


238 Chapter 3 | Vector-Valued Functions 


Figure 3.47 Figures for Exercise 1. 
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Figure 3.48 Figures for Exercise 2. 


3. Suppose that x is a C? path with nonzero velocity. Show 
that x has constant speed if and only if its velocity and 
acceleration vectors are always perpendicular to one 
another. 


4. You are at Vertigo Amusement Park riding the new 
Vector roller coaster. The path of your car is given by 


wt mt 
x(t) = | e'/° cos —, et/® sin —, 
© ( 30 30 


2t(10 — t)(t — r) 


80 + 106 
where t = 0 corresponds to the beginning of your 
three-minute ride, measured in seconds, and spatial 
dimensions are measured in feet. It is a calm day, but 
after 90 sec of your ride your glasses suddenly fly off 
your face. 

(a) Neglecting the effect of gravity, where will your 
glasses be 2 sec later? 


(b) What if gravity is taken into account? 


5. Show that the curve traced parametrically by 


( s 1 ) 
x(t) = | cos(t — 1), t ae 2 


is tangent to the surface x? + y? + z? — xyz = 0 when 
t=1. 


6. Gregor, the cockroach, is on the edge of a Ferris wheel 
that is rotating at a rate of 2 rev/min (counterclock- 
wise as you observe him). Gregor is crawling along 
a spoke toward the center of the wheel at a rate of 
3 in/min. 

(a) Using polar coordinates with the center of the 
wheel as origin, assume that Gregor starts (at time 
t = 0) atthe pointr = 20 ft, 0 = 0. Give paramet- 
ric equations for Gregor’s polar coordinates r and 
@ at time t (in minutes). 


(b) Give parametric equations for Gregor’s Cartesian 
coordinates at time t. 


(c) Determine the distance Gregor has traveled once 
he reaches the center of the wheel. Express your 
answer as an integral and evaluate it numerically. 


If you have used a drawing program on a computer, you have 
probably worked with a curve known as a Bézier curve.” Such 
a curve is defined parametrically by using several control 
points in the plane to shape the curve. In Exercises 7—12, 
we discuss various aspects of quadratic Bézier curves. These 
curves are defined by using three fixed control points (x1, y1), 
(x2, y2), and (x3, y3) and a nonnegative constant w. The Bézier 
curve defined by this information is given by x:[0, 1] > R?, 
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x(t) = (x(t), y(t), where 


(A — txi + 2wt( — dx + x3 


any (A — t} +2wt(l1— t) +t? 
», O<tK<l. 
w= (= tyi + 2we(1 — tyz + ty 
a A-t} +2wt(1—1)+ 2? 
(1) 


@ 7. 
@ 8. 


10. 


11. 


12. 


Let the control points be (1,0), (0,1), and (1, 1). 
Use a computer to graph the Bézier curve for w = 
0, 1/2, 1,2, 5. What happens as w increases? 


Repeat Exercise 7 for the control points (—1, —1), 
(1, 3), and (4, 1). 


. (a) Show that the Bézier curve given by the paramet- 


ric equations in (1) has (x1, yı) as initial point and 
(x3, y3) as terminal point. 

(b) Show that x($) lies on the line segment joining 
(x2, y2) to the midpoint of the line segment joining 
(x1, y1) to (x3, y3). 


In general the control points (x1, y1), (x2, y2), and 
(x3, y3)will form a triangle, known as the control poly- 
gon for the curve. Assume in this problem that w > 0. 
By calculating x’(0) and x’(1), show that the tangent 
lines to the curve at x(0) and x(1) intersect at (x2, y2). 
Hence, the control triangle has two of its sides tangent 
to the curve. 


In this problem, you will establish the geometric sig- 
nificance of the constant w appearing in the equations 
in (1). 

(a) Calculate the distance a between x( $) and (x2, y2). 


(b) Calculate the distance b between x(4) and the 
midpoint of the line segment joining (x1, y1) and 
(x3, y3). 

(c) Show that w = b/a. By part (b) of Exercise 9, 
x(4) divides the line segment joining (x2, y2) to 
the midpoint of the line segment joining (x1, y1) 
to (x3, y3) into two pieces, and w represents the 
ratio of the lengths of the two pieces. 


Determine the Bézier parametrization for the portion 

of the parabola y = x? between the points (—2, 4) and 

(2, 4) as follows: 

(a) Two of the three control points must be (—2, 4) and 
(2, 4). Find the third control point using the result 
of Exercise 10. 


(b) Using part (a) and Exercise 9, we must have that 
x(5) lies on the y-axis and, hence, at the point 


? P Bézier was an automobile design engineer for Renault. See D. Cox, J. Little, and D. O’Shea, Ide- 
als, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative 
Algebra, 3rd ed. (Springer-Verlag, New York, 2007), pp. 28-29. Exercises 7—11 adapted with permission. 
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(0, 0). Use the result of Exercise 11 to determine 
the constant w. 

(c) Now write the Bézier parametrization. You should 
be able to check that your answer is correct. 


13. Let x: (0, 7) > R? be the path given by 
x(t) = (sint, cost + Intan 5), 

where f is the angle that the y-axis makes with the 

vector x(t). The image of x is called the tractrix. (See 

Figure 3.49.) 

(a) Show that x has nonzero speed except when t = 
w/2. 

(b) Show that the length of the segment of the tangent 
to the tractrix between the point of tangency and 
the y-axis is always equal to 1. This means that 
the image curve has the following description: Let 
a horse pull a heavy load by a rope of length 1. 


i 


0.5 1 


—0.5 


=l 


Figure 3.49 The tractrix 
of Exercise 13. 


Suppose that the horse initially is at (0, 0), the load 
at (1, 0), and let the horse walk along the y-axis. 
The load follows the image of the tractrix. 


14. Another way to parametrize the tractrix path given in 
Exercise 13 is 
y: (—00, 0) > R?, 


where y(r) = (e. | y 1— e? do) ; 
0 


(a) Show that y satisfies the property described in part 
(b) of Exercise 13. 


(b) In fact, y is actually a reparametrization of part of 
the path x of Exercise 13. Without proving this fact 
in detail, indicate what portion of the image of x 
the image of y covers. 


15. Suppose that a plane curve is given in polar coordi- 
nates by the equation r = f(@), where f is a function 


of class C?. Use equation (17) in §3.2 to derive the 
curvature formula 
Ir? — rr” + 2r?| 

(r? + r2)3/2 


K(0) = 


(Hint: First give parametric equations for the curve in 
Cartesian coordinates using 0 as the parameter.) 


16. Use the result of Exercise 15 to calculate the curvature 
of the lemniscate r? = cos 20. 


Let x: I + R? be a path of class C? that is not a straight line 
and such that x'(t) # 0. Choose some to € I and let 


y(t) = x(t) — s(t) T(r), 


where s(t) = i ||x’(t)|| dt is the arclength function and T is 


the unit tangent vector. The path y: I > R? is called the invo- 
lute of x. Exercises 17—19 concern involutes of paths. 


17. (a) Calculate the involute of the circular path of radius 
a, that is, x(t) = (a cost, a sin t). (Take tọ to be 0.) 

Q (b) Let a = | and use a computer to graph the path x 
and the involute path y on the same set of axes. 


18. Show that the unit tangent vector to the involute at t 
is the opposite of the unit normal vector N(t) to the 
original path x. (Hint: Use the Frenet—Serret formulas 
and the fact that a plane curve has torsion equal to zero 
everywhere.) 


19. Show that the involute y of the path x is formed by 
unwinding a taut string that has been wrapped around 
x as follows: 
(a) Show that the distance in R? between a point x(t) 
on the original path and the corresponding point 
y(t) on the involute is equal to the distance traveled 
from x(to) to x(t) along the underlying curve of x. 


(b) Show that the distance between a point x(t) on the 
path and the corresponding point y(t) on the in- 
volute is equal to the distance from x(t) to y(t) 
measured along the tangent emanating from x(t). 
Then finish the argument. 


Let x: I + R? be a path of class C? that is not a straight line 
and such that x'(t) # 0. Let 


e(t) = x(t) + “NO. 


This is the path traced by the center of the osculating circle of 
the path x. The quantity p = 1/k is the radius of the osculat- 
ing circle and is called the radius of curvature of the path x. 
The path e is called the evolute of the path x. Exercises 20-25 
involve evolutes of paths. 


20. Letx(t) = (t, t°) bea parabolic path. (See Figure 3.50.) 


(a) Find the unit tangent vector T, the unit normal 
vector N, and the curvature « as functions of t. 


(b) Calculate the evolute of x. 


9 (c) Use a computer to plot x(t) and e(t) on the same 


21. 


Q 22. 


Q 23. 
@ 24. 


25. 
26. 


27. 


set of axes. 


Osculating 
circle 


Parabola 


Figure 3.50 The parabola and its 
osculating circle at a point. The centers 
of the osculating circles at all points of 
the parabola trace the evolute of the 
parabola as described in Exercise 20. 


Show that the evolute of a circular path is a point. 


(a) Use a computer algebra system to calculate the for- 
mula for the evolute of the elliptical path x(t) = 
(a cost, b sint). 


(b) Use a computer to plot x(t) and the evolute e(t) on 
the same set of axes for various values of the con- 
stants a and b. What happens to the evolute when 
a becomes close in value to b? 


Use a computer algebra system to calculate the formula 
for the evolute of the cycloid x(t) = (at — a sint, a — 
a cost). What do you find? 


Use a computer algebra system to calculate the formula 
for the evolute of the cardioid x(t) = (2a cost(1 + 
a cost), 2a sint(1 + a cost)). 


Assuming K'(t) 4 0, show that the unit tangent vector 
to the evolute e(t) is parallel to the unit normal vector 
N(f) to the original path x(f). 


Suppose that a C! path x(t) is such that both its veloc- 
ity and acceleration are unit vectors for all t. Show that 
k = l forall t. 


Consider the plane curve parametrized by 


S 


d cos g(t) dt, yoy= f sin g(t) dt, 
0 0 


where g is a differentiable function. 


(a) Show that the parameter s is the arclength param- 
eter. 


(b) Calculate the curvature k(s). 
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(c) Use part (b) to explain how you can create a 
parametrized plane curve with any specified con- 
tinuous, nonnegative curvature function «(s). 


(d) Give a set of parametric equations for a curve 
whose curvature «(s) = |s|. (Your answer should 
involve integrals.) 


P (e) Use a computer to graph the curve you found in 
part (d), known as a clothoid or a spiral of Cornu. 
(Note: The integrals involved are known as Fres- 
nel integrals and arise in the study of optics. You 
must evaluate these integrals numerically in order 
to graph the curve.) 


28. Suppose that x is a C? path in R? with torsion t always 
equal to 0. 
(a) Explain why x must have a constant binormal vec- 
tor (i.e., one whose direction must remain fixed for 
all £). 


(b) Suppose we have chosen coordinates so that x(0) = 
0 and that v(0) and a(0) lie in the xy-plane (i.e., 
have no k-component). Then what must the binor- 
mal vector B be? 


(c) Using the coordinate assumptions in part (b), show 
that x(t) must lie in the xy-plane for all t. (Hint: 
Begin by explaining why v(t) -k = a(t) + k = 0 for 
all t. Then show that if 


x(t) = x(t)i + y(t)j + z(t)k, 


we must have z(t) = 0 for all t.) 


(d) Now explain how we may conclude that curves 
with zero torsion must lie in a plane. 


29. Suppose that x is a C? path in R°, parametrized by arc- 
length, with x Æ 0. Suppose that the image of x lies in 
the xy-plane. 

(a) Explain why x must have a constant binormal 

vector. 

(b) Show that the torsion t must always be zero. 
Note that there is really nothing special about the im- 
age of x lying in the xy-plane, so that this exercise, 
combined with the results of Exercise 28, shows that 
the image of x is a plane curve if and only if t is always 
zero and if and only if B is a constant vector. 


30. In Example 7 of §3.2 we saw that if x is a straight-line 
path, then x has zero curvature. Demonstrate the con- 
verse; that is, if x is a C? path parametrized by arc- 
length s and has zero curvature for all s, then x traces 
a straight line. 


31. A large piece of cylindrical metal pipe is to be manu- 
factured to include a strake, which is a spiraling strip 
of metal that offers structural support for the pipe. (See 
Figure 3.51.) The pieces of the strake are to be made 
from flat pieces of flexible metal whose curved sides 
are arcs of circles as shown in Figure 3.52. Assume that 
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the pipe has a radius of a ft and that the strake makes 
one complete revolution around the pipe every A ft. 


| 


Figure 3.51 A cylindrical 
pipe with strake attached. 


Figure 3.52 A section of 
the strake. (See Exercise 31.) 


(a) In terms of a and h, what should the inner radius r 
be so that the strake will fit snugly against the pipe? 


(b) Suppose a = 3 ft and h = 25 ft. What is r? 


Suppose that x: I —> R? is a path of class C? parametrized by 
arclength. Then the unit tangent vector T(s) defines a vector- 
valued function T: I —> R? that may also be considered to be a 
path (although not necessarily one parametrized by arclength, 
nor necessarily one with nonvanishing velocity). Since T is a 
unit vector, the image of the path T must lie on a sphere of 
radius | centered at the origin. This image curve is called the 
tangent spherical image of x. Likewise, we may consider the 
functions defined by the normal and binormal vectors N and B 
to give paths called, respectively, the normal spherical image 
and binormal spherical image of x. Exercises 32—35 concern 
these notions. 


32. Find the tangent spherical image, normal spherical 
image, and binormal spherical image of the circular 
helix x(t) = (a cost, a sint, bt). (Note: The path x is 
not parametrized by arclength.) 


33. Suppose that x is parametrized by arclength. Show 
that x is a straight-line path if and only if its tangent 
spherical image is a constant path. (See Example 7 of 
§3.2 and Exercise 30.) 


34. Suppose that x is parametrized by arclength. Show that 
the image of x lies in a plane if and only if its binormal 
spherical image is constant. (See Exercises 28 and 29.) 


35. Suppose that x is parametrized by arclength. Show 
that the normal spherical image of x can never be 
constant. 


36. In this problem, we will find expressions for velocity 
and acceleration in cylindrical coordinates. We begin 


37. 


38. 


39. 


with the expression 
x(t) = x(Hi + y(Oj + zk 


for the path in Cartesian coordinates. 


(a) Recall that the standard basis vectors for cylindri- 
cal coordinates are 


e, = cosĝ i + sing j, 
e = — sin ĝ i + cos j, 
e& =k. 


Use the facts that x = r cos and y = r sin to 
show that we may write x(t) as 


x(t) = r(t)e, + z(t) e. 


(b) Use the definitions of e,, eg, and e, just given and 
the chain rule to find de, /dt, deg /dt, and de,/dt 
in terms of e,, eg, and e,. 


(c) Now use the product rule to give expressions for v 
and a in terms of the standard basis for cylindrical 
coordinates. 


Suppose that the path 
x(t) = (sin 2t, V2 cos 2t, sin 2t — 2) 


describes the position of the Starship Inertia at time t. 


(a) Lt. Commander Agnes notices that the ship is trac- 
ing a closed loop. What is the length of this loop? 


(b) Ensign Egbert reports that the Inertia’s path is 
actually a flow line of the Martian vector field 
F(x, y, z) = yi — 2xj + yk, but he omitted a con- 
stant factor when he entered this information in 
his log. Help him set things right by finding the 
correct vector field. 


Suppose that the temperature at points inside a room is 

given by a differentiable function T(x, y, z). Livinia, 

the housefly (who is recovering from a head cold), is in 

the room and desires to warm up as rapidly as possible. 

(a) Show that Livinia’s path x(t) must be a flow line 
of kVT, where k is a positive constant. 

(b) If T(x, y, z) = x? — 2y? + 32? and Livinia is ini- 
tially at the point (2,3,—1), describe her path 
explicitly. 


Let F = u(x, y)i— v(x, y)j be an incompressible, 

irrotational vector field of class C°. 

(a) Show that the functions u and v (which deter- 
mine the component functions of F) satisfy the 
Cauchy—Riemann equations 

ðu ðv ðu dv 


— = >, an =-—. 
Ox dy 


3 See F. Morgan, Riemannian Geometry: A Beginners Guide, 2nd ed. (A K Peters, Wellesley, 1998), 
pp. 7-10. Figures 3.51 and 3.52 adapted with permission. 


40. 


41. 


(b) Show that u and v are harmonic, that is, that 


u dru 3v 3v 
—+-——=0 d —~+-—~=0. 
ax? t dy? ae axe T dy? 


Suppose that a particle of mass m travels along a path 
x according to Newton’s second law F = ma, where 
F is a gradient vector field. If the particle is also con- 
strained to lie on an equipotential surface of F, show 
that then it must have constant speed. 


Let a particle of mass m travel along a differentiable 
path x in a Newtonian vector field F (i.e., one that 
satisfies Newton’s second law F = ma, where a is the 
acceleration of x). We define the angular momen- 
tum I(t) of the particle to be the cross product of the 
position vector and the linear momentum my, that is, 


I(t) = x(t) X mv(t). 
(Here v denotes the velocity of x.) The torque about 
the origin of the coordinate system due to the force F 


is the cross product of position and force: 


M(t) = x(t) x F(t) = x(t) xX ma(t). 


42. 


43. 


44. 
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(Also see §1.4 concerning the notion of torque.) Show 
that 


dl 
—_—=M 
dt 
Thus, we see that the rate of change of angular mo- 


mentum is equal to the torque imparted to the particle 
by the vector field F. 


Consider the situation in Exercise 41 and suppose that 
F is a central force (i.e., a force that always points 
directly toward or away from the origin). Show that in 
this case the angular momentum is conserved, that is, 
that it must remain constant. 


Can the vector field 
F = (e* cosy + e™ sinz)i — e” sin yj +e * coszk 


be the gradient of a function f(x, y, z) of class C?? 
Why or why not? 


Can the vector field 
F =x + i+ Qe — e)j+ x k 


be the curl of another vector field G(x, y, z) of class 
C?? Why or why not? 
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Maxima and Minima 
in Several Variables 


4.1 Differentials and Taylor’s Theorem 


Among all classes of functions of one or several variables, polynomials are without 
a doubt the nicest in that they are continuous and differentiable everywhere and 
display intricate and interesting behavior. Our goal in this section is to provide 
a means of approximating any scalar-valued function by a polynomial of given 
degree, known as the Taylor polynomial. Because of the relative ease with which 
one can calculate with them, Taylor polynomials are useful for work in computer 
graphics and computer-aided design, to name just two areas. 


Taylor’s Theorem in One Variable: A Review - 


Suppose you have a function f: X C R —> R that is differentiable at a point a in 
X. Then the equation for the tangent line gives the best linear approximation for 
f near a. That is, when we define pı by 


pi(x)= f(a)+ f (a(x —a), wehave pi(x) © f(x) ifx ~a. 
(See Figure 4.1.) As explained in §2.3, the phrase “best linear approximation” 
means that if we take R(x, a) to be f(x) — pi (x), then 


R 
im AA g 


x>a x—a 
Note that, in particular, we have pı(a) = f(a) and p (a) = f'(a). 

Generally, tangent lines approximate graphs of functions only over very small 
neighborhoods containing the point of tangency. For a better approximation, we 
might try to fit a parabola that hugs the function’s graph more closely as in 
Figure 4.2. In this case, we want pz to be the quadratic function such that 


pa) = f(a), pj(a)= f'(a), and p3(a) = f"(a). 
The only quadratic polynomial that satisfies these three conditions is 


_f'@) 
© 2 


p2(x) = fla) + FO — a) (x —a). 


It can be proved that, if f is of class C?, then 
f(x) = pox) + R(x, a), 


y=-3x°+2x-3 


Figure 4.3 Approximations to 


f(x) = Inx. 
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y y 

y =pi(x) y =pi(x) y = p(x) 

y=f(x) y=f(x) 

= H x 
a a 


Figure 4.1 The graph of y = f(x) Figure 4.2 The tangent graphs of 


and its tangent line y = pı(x) at f, pi, and p2. 
x =a. 
where 
. R(x,a 
lim Rais) ) = 0. 


x>a (x — a} 
EXAMPLE 1 If f(x) = lnx, then, for a = 1, we have 
fd) =In1l =0, 
; 1 
f(=7=1, 


1 
iY) = age 
Hence, 
nx)=04+1@-)=x*-1, 
pox) =0+ 1-1) - 40 


The approximating polynomials pı and p2 are shown in Figure 4. 


1} = jx? H 2x 3. 
3. + 

There is no reason to stop with quadratic polynomials. Suppose we want to 
approximate f by a polynomial p of degree k, where k is a positive integer. 
Analogous to the work above, we require that p and its first k derivatives agree 
with f and its first k derivatives at the point a. Thus, we demand that 


pla) = f(a), 
p,a) = f'(a), 
p:a) = f(a), 


k 
Py (a) = fC). 
Given these requirements, we have only one choice for px, stated in the following 


theorem: 


THEOREM 1.1 (TAYLOR’S THEOREM IN ONE VARIABLE) Let X be open in 
R and suppose f:X CR— R is differentiable up to (at least) order k. 
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Figure 4.4 The graphs of 


(1) y=x-—a, 
(2) y = (x — a}, and 
B) y =- a’. 


Note how much more closely the 
graph of (3) hugs the x-axis than 
that of (1) or (2). 


Givena € X, let 


pals) = fla) + oa- aa E Oaa O 


Then 
fŒ) = p(x) + R(x, a), 


where the remainder term R; is such that R(x, a)/(x — a)‘ > 0 as x > a. 


The polynomial defined by formula (1) is called the kth-order Taylor poly- 
nomial of f at a. The essence of Taylor’s theorem is this: For x near a, the Taylor 
polynomial p approximates f in the sense that the error R; involved in making 
this approximation tends to zero even faster than (x — a)* does. When k is large, 
this is very fast indeed, as we see graphically in Figure 4.4. 


EXAMPLE 2 Consider Inx with a = 1 again. We calculate 


f() =In1 =0, 
iis e] 
ü= 
W 1 
f a= sae 
pkl p 
fay = £ D! He. 
Therefore, 
_4)k-1 
pale) =O - 1-5-4 pe —P-- + Gy 


Taylor’s theorem as stated in Theorem 1.1 says nothing explicit about the 
remainder term R;. However, it is possible to establish the following derivative 
form for the remainder: 


PROPOSITION 1.2 If f is of class C*+!, then there exists some number z be- 
tween a and x such that 


k+1) 
R(x, a) = LO" =a 


(k+1)! @) 


In practice, formula (2) is quite useful for estimating the error involved with a 
Taylor polynomial approximation. Both Theorem 1.1 (under the slightly stronger 
hypothesis that f is of class C‘*') and Proposition 1.2 are proved in the addendum 
to this section. 


EXAMPLE 3 The fifth-order Taylor polynomial of f(x) = cos x about x = 2/2 


© mozhe- a-e 
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(You should verify this calculation.) According to formula (2), the difference 
between ps and cos x is 


a f(z) 1 \ 6 COS Z 7 \ $6 
Bs(x, 5) =“ (#-5) =r (&-5) - 

2 6! 2 6! 2 
where z is some number between x /2 and x. Since | cos x| is never larger than 1, 
we have 

6 
las (x, =)| = COS Z (x ) 
2 6! 2 

Thus, for x in the interval [0, x], we have 


m (x — 1 /2)° me 
[Rs (x, ; )| < ~ 0.0209. 


Pia (2) 
= 720 


720 46,080 
In other words, the use of the polynomial ps above in place of cosx will be 
accurate to at least 0.0209 throughout the interval [0, zr]. + 


Taylor’s Theorem in Several Variables: 
The First-order Formula 


For the moment, suppose that f: X C R? — R is a function of two variables, 
where X is open in R° and of class C!. Then near the point (a, b) € X, the best 
linear approximation to f is provided by the equation giving the tangent plane at 
(a, b, f(a, b)). That is, 


f(x,y) © pil, y), 
where 


Pix, y) = f(a, b)+ frla, bx — a) + fy(a, b) — b). 
Note that the linear polynomial p; has the property that 


pila, b) = f(a, b); 


dpi _ of 
oe a 
dpi _ f 
m b) — ay b). 


Such an approximation is shown in Figure 4.5. 
To generalize this situation to the case of a function f: X C R” > R of 
class C!, we naturally use the equation for the tangent hyperplane. That is, if 


x 


Figure 4.5 The graph of z = f(x, y) and 
z = pi(x, y). 
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a = (a1, d2,...,d,) E€ X, then 
F (21, X2; -+3 Xn) © Pi, X2, --- Xn), 
where 
Pi&i, -5 Xn) = f(a) + fr (@O1 — a1) + fr. (@)(X2 — a2) 
+e + fy, (an — an). 


Of course, the formula for pı can be written more compactly using either 
-notation or, better still, matrices: 


Pi, -- -> Xn) = f(a) + » Faai — ai) = f(a) + Df(ayx—a). (3) 


i=l 


EXAMPLE 4 Let f(x1, X2, X3, x4) = x) + 2x2 + 3x3 + 4x4 + x1 x2x3x4. Then 


of of 
z = l + x2x3x4, z = 2 + xX1X3X4, 
Ox, OX2 
a 
ae = 3 + x1X2x4, of = 44 x1X2x3. 
0x3 0x4 
Ata = 0 = (0, 0, 0, 0), we have 
a a a a 
oF @ = l; oF 0) =2, dÍ @) =3, oF @) = 4. 
Ox] 0x2 0x3 Ox4 


Thus, 
Pi(X1, x2, X3, X4) = 0 + L(x, — 0) + 2(x2 — 0) + 3(x3 — 0) + 4x4 — 0) 
=x, + 2x. + 3x3 + 4x4. 


Note that pı contains precisely the linear terms of the original function f. On the 
other hand, if a = (1, 2, 3, 4), then 


a) 0 

I 1,2,3,4) = 25, °F 19,3,4) = 14, 
Ox, 3x2 

a) ə 

°F 1,2,3,4) =11, °F 1,2,3,4) = 10, 
0x3 0x4 


so that, in this case, 


Pi(X1, X2, X3, X4) = 54 + 25(x1 — 1) + 14(x2 — 2) + 11 (x3 — 3) + 10(x4 — 4). 
+ 


The relevant theorem regarding the first-order Taylor polynomial is just a re- 
statement of the definition of differentiability. However, since we plan to consider 
higher-order Taylor polynomials, we state the theorem explicitly. 


THEOREM 1.3 (FIRST-ORDER TAYLOR’S FORMULA IN SEVERAL VARIABLES) Let 
X be open in R” and suppose that f: X C R” — R is differentiable at the point 
a in X. Let 


Pi(x) = f(a) + Df(a)(x — a). (4) 
Then 
f(x) = pi(x) + Ri(x, a), 


where R(x, a)/||x — al] —> 0 as x > a. 
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Note that we may also express the first-order Taylor polynomial using the gradient. 
In place of (4), we would have 


pix) = f(a) + V f(a): (x — a). 


Differentials 


Before we explore higher-order versions of Taylor’s theorem in several variables, 
we consider the linear (or first-order) approximation in further detail. 
Let h = x — a. Then formula (3) becomes 


af 
OX; 


pi(x) = f(a) + Dfa) = f(a)+ $` (ahi. (5) 
i=l 


We focus on the sum appearing in formula (5) and summarize its salient 
features as follows: 


DEFINITION 1.4 Let f : X C R” — R and let a € X. The incremental 
change of f, denoted Af, is 


Af = f@ +h) - f(a). 
The total differential of f, denoted df (a, h), is 


ð 0 0 
df (a, h) = oe a + U ah +-+ H i 
Ox] 0x2 Onn 
The significance of the differential is that for h ~ 0, 
Af ®& df. 


(We have abbreviated df (a, h) by df.) 


Sometimes h; is replaced by the expression Ax; or dx; to emphasize that it 
represents a change in the ith independent variable, in which case we write 


af of 


af 
df = dx, + dx: +4 dXp. 
f Ox, "i 3x2 az Xn i 


(We’ve suppressed the evaluation of the partial derivatives at a, as is customary.) 


EXAMPLE 5 Suppose f(x, y, z) = sin(vyz) + cos(xyz). Then 


= yz[cos(xyz) — sin(xyz)|dx + xz[cos(xyz) — sin(xyz)]dy 
+ xy[cos(xyz) — sin(xyz)]dz 
= (cos(xyz) — sin(xyz))(yzdx + xz dy + xy dz). é 


The geometry of the differential arises, naturally enough, from tangent lines 
and planes. (See Figures 4.6 and 4.7.) In particular, the incremental change Af 
measures the change in the height of the graph of f when moving from a to a + h; 
the differential change df measures the corresponding change in the height of 
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y 


Figure 4.6 The incremental Figure 4.7 The incremental 
change Af equals the change in change Af equals the change in 
y-coordinate of the graph of z-coordinate of the graph of 

y = f(x) as the x-coordinate of a z= f(x, y) asa point in R? 

point changes from a toa + dx. changes from a = (a, b) to 

The differential df equals the a+h=(a+h,b+k). The 
change in y-coordinate of the differential df equals the change in 
graph of the tangent line at a (i.e., z-coordinate of the graph of the 
the graph of y = pj(x)). tangent plane at (a, b). 


the graph of the (hyper)plane tangent to the graph at a. When ||h|| is small (i.e., 
when a + his close to a), the differential df approximates the increment Af and 
it is often easier from a technical standpoint to work with the differential. 


EXAMPLE 6 Let f(x, y)=x—y+2x?+ xy?. Then for (a, b) = (2, —1), 
we have that the increment is 


Af = f2 + Ax, —1 + Ay) — 70-1) 
=2 + Ax — (—1 + Ay) + 2(2 + Axy + (2 + Ax)(—1 + Ayy — 13 
= 10Ax — 5Ay + (Ax? — 2Ax Ay + (Ay? + Ax(Ay). 


On the other hand, 


df ((2, -1), (Ax, Ay) = fe(2, —DAx + fy(2, —DAy 
= (14+ 4x + yla—-nAx + (-1 ++ 2xy)le-nAy 
= 10Ax — 5Ay. 


We see that df consists of exactly the terms of Af that are linear in Ax and Ay 
(i.e., appear to first power only). This will always be the case, of course, since 
that is the nature of the first-order Taylor approximation. Use of the differential 
approximation is often sufficient in practice, for when Ax and Ay are small, higher 
powers of them will be small enough to make virtually negligible contributions 
to Af. For example, if Ax and Ay are both 0.01, then 


df = (0.1 — 0.05) = 0.05 
and 


Af = (0.1 — 0.05) + 0.0002 — 0.0002 + 0.0002 + 0.000001 
= 0.05 + 0.000201 = 0.050201. 


Thus, the values of df and Af are the same to three decimal places. + 


Figure 4.8 Which would 
you buy? 
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EXAMPLE 7 A wooden rectangular block is to be manufactured with dimen- 
sions 3 in x 4 in x 6 in. Suppose that the possible errors in measuring each di- 
mension ofthe block are the same. We use differentials to estimate how accurately 
we must measure the dimensions so that the resulting calculated error in volume 
is no more than 0.1 in’. 

Let the dimensions of the block be denoted by x (~ 3 in), y (~ 4 in), and z 
(~ 6 in). Then the volume of the block is 


V=xyz and V¥3-4-6=72in’. 


The error in calculated volume is AV, which is approximated by the total differ- 
ential dV. Thus, 


AV ~ dV = V,(3, 4, 6)Ax + Vy(3, 4, 6)Ay + V:(3, 4, Az 
= 24Ax + 18Ay + 12Az. 


If the error in measuring each dimension is €, then we have Ax = Ay = Az = €. 
Therefore, 


dV = 24Ax + 18Ay + 12Az = 24e + 18e + 12e = 54e. 


To ensure (approximately) that |AV| < 0.1, we demand 
|dV| = |54e| < 0.1. 


Hence, 
0.1 . 
le] < — = 0.0019 in. 
54 
So the measurements in each dimension must be accurate to within 0.0019 in. @ 


EXAMPLE 8 The formula for the volume of a cylinder of radius r and height 
h is V(r, h) = mr7h. If the dimensions are changed by small amounts Ar and 
Ah, then the resulting change AV in volume is approximated by the differential 
change dV. That is, 


aV aV 
AV xdV = cs Ar+ ak Ah = 2nrhAr + rr Ah. 
r 


Suppose the cylinder is actually a beer can, so that it has approximate dimensions 
ofr = 1 inandh = 5 in. Then 


dV =x(10Ar + Ah). 


This statement shows that, for these particular values of r and h, the volume 
is approximately 10 times more sensitive to changes in radius than changes in 
height. That is, if the radius is changed by an amount e€, then the height must be 
changed by roughly 10e to keep the volume constant (i.e., to make AV zero). 
We use the word “approximate” because our analysis arises from considering the 
differential change dV rather than the actual incremental change AV. 

This beer can example has real application to product marketing strategies. 
Because the volume is so much more sensitive to changes in radius than height, 
it is possible to make a can appear to be larger than standard by decreasing its 
radius slightly (little enough so as to be hardly noticeable) and increasing the 
height so no change in volume results. (See Figure 4.8.) This sensitivity analysis 
shows that even a tiny decrease in radius can force an appreciable compensating 
increase in height. The result can be quite striking, and these ideas apparently 
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have been adopted by at least one brewery. Indeed, this is how the author came to 
fully appreciate differentials and sensitivity analysis.! + 


Taylor’s Theorem in Several Variables: 
The Second-order Formula 


Suppose f: X C R? > R is a C? function of two variables. Then we know that 
the tangent plane gives rise to a linear approximation pı of f near a given point 
(a, b) of X. We can improve on this result by looking for the guadric surface that 
best approximates the graph of z = f(x, y) near (a, b, f(a, b)). See Figure 4.9 
for an illustration. That is, we search for a degree 2 polynomial p2(x, y) = Ax? + 
Bxy + Cy? + Dx + Ey + F such that, for (x, y) ~ (a, b), 


f(x, y) © p(x, y). 


Quadric 
surface 


y y 

(a, b, f(a, b)) 
Tangent 
plane 


z =f(x, y) 


x 


Figure 4.9 The tangent plane and quadric 
surface. 


Analogous to the linear approximation p4, it is reasonable to require that p2 and 
all of its first- and second-order partial derivatives agree with those of f at the 
point (a, b). That is, we demand 


p2(a, b) = f(a, b), 


f) 0 f) 0 
22 (a,b) = L (a,b), Le ee) 
Ox Ox dy dy (6) 
ə? po a f ə? po of 
Ao a , b = a ’ b 9 ’ b = Sao ’ b ’ 
ax? (a, 5) ax? ae) axdy fash) ðxðy (a,b) 
0° p2 a f 
——(a, b) = — (a,b 
py? 0b) = 7b) 


After some algebra, we see that the only second-degree polynomial meeting these 
requirements is 


pox, y) = fla, b)+ fela, bx — a) + fila, b\(y — b) 
+} fex(a, b\(x — a) + fry(a, b\(x — a)(y — b) 
+ } fyy(a, Dy — bY. (7) 


l See S. J. Colley, The College Mathematics Journal, 25 (1994), no. 3, 226-227. Art reproduced with 
permission from the Mathematical Association of America. 
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How does formula (7) generalize to functions ofn variables? We need to begin 
by demanding conditions analogous to those in (6) fora function f: X C R” > R. 


For a = (a1, a2, ..., an) € X, these conditions are 
p2(a) = f(a), 
Pa) = a), $219 sccm (8) 
a pe (a) = aii W I= Lust 


OXj OX; OXj OX; 


If you do some algebra (which we omit), you will find that the only polynomial 
of degree 2 that satisfies the conditions in (8) is 


n 1 n 
PX) = SDH) KAH D fD aaja). 9) 
i=l i j=l 
(Note that the second sum appearing in (9) is a double sum consisting ofn? terms.) 


To check that everything is consistent when n = 2, we have 


P2(X1, x2) = f(a, a2) + fx (Qi, a2)(x1 — a1) + fx (a1, A2)(X2 — a2) 
+ 4 [fers (@1, aŒ — a1)? + frolar, a) — a1)(%2 — a2) 


T frx (a, a)(x2 a a)(xı E ai) F Jox (a, az)(x2 a a) | . 


When f is a C? function, the two mixed partials are the same, so this formula 
agrees with formula (7). 


EXAMPLE 9 Let f(x, y, z) = e*** and let a = (a, b, c) = (0, 0, 0). Then 
f(0,0, 0) =e =1, 
f.0,0, = 7,0,0,0) = 7,0,0,0) =e? =1, 
Fixx (0, 0, 0) = Fry, 0, 0) = fxz(0, 0, 0) = fry, 0, 0) 
= 70,0, 0) = 740.0, 022" = 1, 


Thus, 


p(x, y,z)=1+ I(x —0)4+ 1(y — 0) + 1(z — 0) 

5 [1@x — 0)? +2: 1(x — Oy — 0) + 2+ 1x — 0(z — 0) 
+ 1y — 0) +2: 1(y — Oz — 0) + 1(z — 0)" 

=1 +x +y +z+ 5x7 +uytuzt ty? +yz+ 52° 


=l+(xtytz)t+ia+y+zy. 
We have made use of the fact that, since f is of class C?, a term like 


fry(0, 0, 0) — O)(y — 0) isequalto fyx(0,0, 0) -0 -0). 4 


Now we state the second-order version of Taylor’s theorem precisely. 
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THEOREM 1.5 (SECOND-ORDER TAYLOR’S FORMULA) Let X be open in R”, and 
suppose that f: X C R” —> R is of class C’. Let 


n 1 n 
px) = SDH) KA D, fe Ai — a); — ay). 
i=1 i,j=l 
Then 
F(%) = po(x) + R(x, a), 


where | R2|/||x — all? > 0 as x > a. 


A version of Theorem 1.5, under the stronger assumption that f is of class C?, is 
established in the addendum to this section. 
EXAMPLE 10 Let f(x, y) = cos x cos y and (a, b) = (0, 0). Then 
f(0, 0) = 1; 
f<(0, 0) = — sin x cos y|, = 0, f,(0, 0) = — cos x sin y|0,9 = 0; 
fxx(0, 0) = — cos x cos yl, = —1, 
fry(0, 0) = sin x sin y|o,0) = 0, 
fyy(0, 0) = — cos x cos yl, = —1. 
Hence, 
f(x,y) © pax, y= 1+ 3(-1-x? — 1+’) = 1 a. 


We can also solve this problem another way since f is a product of two functions. 
We can multiply the two Taylor polynomials: 


P2(x, y) = (Taylor polynomial for cos x) - (Taylor polynomial for cos y) 


= (1-H) 0-9) 


=1— 5x — 3Y up to terms of degree 2. 


This method is justified by noting that if q2 is the Taylor polynomial for cosine 
and R3 is the corresponding remainder term, then 


cosx cos y = [q2(x) + Ra(x, 0)][q2(y) + Ra(y, 0)] 
= q2(x)qa(y) + q2(y)Ra(x, 0) + q2(x)R2(y, 0) + Ro(x, 0)R2(y, 0) 
= qo(x)g2(y) + other stuff, 


where (other stuff)/||(x, y)||>7 —> 0 as (x, y) > (0,0), since both R2(x, 0) and 
Ro(y, 0) do. + 


The Hessian 
Recall that the formula for the first-order Taylor polynomial pı was written quite 
concisely in formula (5) by using vector and matrix notation. It turns out that it 
is possible to do something similar for the second-order polynomial p2. 
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DEFINITION 1.6 The Hessian ofa function f: X C R” — Ris the matrix 
whose ijth entry is 0? f/d.x;0x;. That is, 


trix trix sia Srixy 


= Fran Fran i Fram 


Fan: Frnxr ine a 


The term “Hessian” comes from Ludwig Otto Hesse, the mathematician who 
first introduced it, not from the German mercenaries who fought in the American 
revolution. 

Now let’s look again at the formula for pz in Theorem 1.5: 


n 1 n 
PAX) = SAY hi5 D7 fr hih. 
i=l ij=l 
(We have let h = (h1, ..., An) = x — a.) This can be written as 


hz 


B 
pos) = fla)+| fala) AORE] 

| hn | 

frx (a) faala) = Jaxa) hı 

fraxi (8a) frx (8) nae Jor (A) hy 

hy | : : a : : 


frx a) fraza (8) = Jeax, (a) hn 


Thus, we see that 


p2(x) = f(a) + Df(a)h + zh” H f (a)h. 


(Remember that h” is the transpose of the n x 1 matrix h.) 


EXAMPLE 11 (Example 10 revisited) For f(x, y) = cos x cos y, a = (0, 0), 
we have 


Df (x,y) = |- sinx cosy — cosx siny] 
and 


are. =| 


— cosx cosy sinx sin y 
sinx siny —cosxcosy |’ 
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Hence, 
p(x, y) = f(0, 0) + Df (0, 0)h + Sh’ Hf (0, O)h 


=14+[0 oJ a [tale Ilo EJA 


= 1— 4h} — ih. 
Once we recall thath = (h1, h2) = (x — 0, y — 0) = (x, y), we see that this result 
checks with our work in Example 10, just as it should. + 


Higher-order Taylor Polynomials 


So far we have said nothing about Taylor polynomials of degree greater than 2 
in the case of functions of several variables. The main reasons for this are (i) the 
general formula is quite complicated and has no compact matrix reformulation 
analogous to (10) and (ii) we will have little need for such formulas in this text. 
Nonetheless, if your curiosity cannot be denied, here is the third-order Taylor 
polynomial for a function f: X C R” > R of class C? near a € X: 


n 1 n 
psx) = SA+) ful @MOi— a1) D fiA- aa; — ay) 
i=l i,j=l 


n 


+= YS fogai- aj — a)r — a4). 


| 
3! i, j,k=1 


(The relevant theorem regarding p3 is that f(x) = p3(x) + R3(x, a), where 
|R3(x, a)|/||x — al]> —> 0 as x —> a.) If you must know even more, the kth-order 
Taylor polynomial is 


n 1 n 
PAX) = SDH) AA D fiA — aly aj 
i=1 i,j=1 


+ +4 2 E E E) 


Formulas for Remainder Terms (optional) 


Under slightly stricter hypotheses than those appearing in Theorems 1.3 and 
1.5, integral formulas for the remainder terms may be derived as follows. Set 
h = x — a. If f is of class C?, then 


R(x, a) = aah (l — t) frx, (a + th)h;h; dt 


i,j=l 


= [ [h’ Hf (a+ thyh] (1 — 1) dr. 
0 
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If f is of class C?, then 


- -t 
R(x, a) = fuxa + thyhjhjhy dt, 
E 2 1 


and if f is of class C**!, then 


= = y 
R, a) = Janza Xiggy (a + th)h;, hj, ay hina dt. 


pecs 2 1 


Although explicit, these formulas are not very useful in practice. By artful appli- 
cation of Taylor’s formula for a single variable, we can arrive at derivative versions 
of these remainder terms (known as Lagrange’s form of the remainder) that are 
similar to those in the one-variable case. 


Lagrange’s form of the remainder. If f is of class C?, then in Theorem 1.3 
the remainder R is 


Ri (x, a) = De fix; (Zhi ihj 


2 m 


for a suitable point z in the domain of f on the line segment joining a and 
x = a + h. Similarly, if f is of class C?, then the remainder Rz in Theorem 
1.5 is 
il n 
R(x, a) = z 2o Sexa hihih 


k=l 
for a suitable point z on the line segment joining a and x = a + h. More 
generally, if f is of class C*+', then the remainder Ry is 


1 


Ry (xX, a) = (k+)! > PES xa hiahia: -h ik+1 
figa 


sik}1=l 


for a suitable point z on the line segment joining a and x = a + h. 


The remainder formulas above are established in the addendum to this section. 


EXAMPLE 12 For f(x, y) = cos x cos y, we have 


2 
31 > frixjx(Zhihj he 


* |i pk=l 


1 2 
<, 21 ` [hih;hxl, 
j k=l 


since all partial derivatives of f will be a product of sines and cosines and, hence, 
no larger than 1 in magnitude. Expanding the sum, we get 


|Ra(x, y, 0, O| < t (lai + 3ht|hal + 3111h + 1h21) . 
If both |4;| and |h2| are no more than, say, 0.1, then 
|Ro(x, y, 0, 0)| < 4 (8 - (0.1)°) = 0.0013. 


1 
|Ro(x, y; 0, 0)| = Ss, 
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y 


0.1 


-0.1 0.1 


-0.1 


Figure 4.10 The polynomial pz 
approximates f to within 0.0013 
on the square shown. (See 
Example 12.) 


cos x COs y 


Figure 4.11 The graph of f(x, y) = 
cos x cos y and its Taylor polynomial 
po(x, y) = 1 — 4x” — Sy? over the square 


(œ, y) -1 <x<1,—-1 <y <1}. 


So throughout the square of side 0.2 centered at the origin and shown in Fig- 
ure 4.10, the second-order Taylor polynomial is accurate to at least 0.0013 (i.e., to 
two decimal places) as an approximation of f(x, y) = cos x cos y. In Figure 4.11, 
we show the graph of f(x, y) = cos x cos y over the square domain {(x, y) | —1 < 
x <1, —1 < y < 1} together with the graph of its second-order Taylor polyno- 
mial po(x, y) = 1 — $x? — Sy? (calculated in Example 10). Note how closely the 
surfaces coincide near the point (0, 0, 1), just as the analysis above indicates. ® 


Addendum: Proofs of Theorem 1.1, Proposition 1.2, 
and Theorem 1.5 


Below we establish some of the fundamental results used in this section. We begin 
by proving Theorem 1.1, Taylor’s theorem for function of a single variable, and 
Proposition 1.2 regarding the remainder term in Theorem 1.1. We then use these 
results to “bootstrap” a proof of the multivariable result of Theorem 1.5 and to 
derive Lagrange’s formula for the remainder term appearing in it. 


Proof of Theorem 1.1 We prove the result under the stronger assumption that f 
is of class C*+! rather than assuming that f is only differentiable up to order k. 
(This distinction matters little in practice.) 

By the fundamental theorem of calculus, 


f@)—f@= / fiat. (11) 


We evaluate the integral on the right side of (11) by means of integration by parts. 
Recall that the relevant formula is 


[rdv=w- f vau. 


We use this formula with u = f’(t) and v = x — t so that dv = —dt. (Note that 
in the right side of (11), x plays the role of a constant.) We obtain 


| roa=-roe-o}+ f e-0'Od 


=(@Oe=2) + / “OOF "Oat. (12) 
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Combining (11) and (12), we have 


fs) = fOr Oa f (x — 1) f"(t) dt. (13) 


Thus, we have shown, when f is differentiable up to (at least) second order, that 


Reas [e -OfO at. 


This provides an integral formula for the remainder in formula (1) of Theorem 1.1 
when k = 1, although we have not yet established that R(x, a)/(x — a) —> 0 as 
x>a. 

To obtain the second-order formula, the case k = 2 of (1), we focus on 
Ri(x, a) = ie (x — t)f’(t)dt and integrate by parts again, this time with u = 
f"(t) and v = (x — t} /2, so that dv = —(x — t) dt. We obtain 


x n — 4)\2 |* X _ 4/2 
[c-oro je TOE OY l aD? prar 


_ f'a -a? . Pen 
z 2 l | 2 


2 
f(t) dt. 
Hence (13) becomes 


fe) = f+ fax -a+ Ow af SS C=O prejar. 


Therefore, we have shown, when f is differentiable up to Be least) third order, 
that 


Rpa [S C- moat. 


We can continue to argue in this manner or use mathematical induction to show 
that formula (1) holds in general with 


R= | = t pD(0) de, (14) 


assuming that f is differentiable up to order (at least) k + 1. 

It remains to see that R(x, a)/(x — a) + 0 as x — a. In formula (14) we 
are only considering t between a and x, so that |x — t| < |x — a|. Moreover, since 
we are assuming that f is of class C*+!, we have that f+!)(r) is continuous and, 
therefore, bounded for t between a and x (i.e., that | f“*(t)| < M for some 


constant M). Thus, 
x —t k 
[S (x — m= Daar < 2| (x 7 ) 


where the plus sign M if x > a and the negative sign if x < a, 


|Rx(x, @)| < 


Pdi 


“M k M k+l 
zaj poal d= geal ; 


Thus, 


as x — a, as desired. E 
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Proof of Proposition 1.2 We establish Proposition 1.2 by means of a general 
version of the mean value theorem for integrals. This theorem states that for 
continuous functions g and h such that h does not change sign on [a, b] (i.e., 
either h(t) > 0 on [a, b] or h(t) < 0 on [a, b]), there is some number z between 
a and b such that 


b b 
/ g(t)h(t) dt = g(z) / node 


(We omit the proof but remark that this theorem is a consequence of the interme- 
diate value theorem.) Applying this result to formula (14) with g(t) = fP) 
and h(t) = (x — t)*/k!, we find that there must exist some z between a and x 


such that 
r ake L Ak N f= 
R(x, a) = mg] 2 = dt = 9 -F ) t=a 
FET) 
= gry O m 


Proof of Theorem 1.5 As inthe proof of Theorem 1.1, we establish Theorem 1.5 
under the stronger assumption that f is of class C?. Begin by setting h = x — a, 
so that x = a+ h, and consider a and h to be fixed. We define the one-variable 
function F by F(t) = f(a+ th). Since f is assumed to be of class C? on an 
open set X, if we take x sufficiently close to a, then F is of class C? on an open 
interval containing [0, 1]. Thus, Theorem 1.1 with k = 2, a = 0, and x = | may 
be applied to give 


F"(0) 
2! 


+ Ro(1, 0), (15) 


F(1) = F0) + FO — 0) + (1 — 0)’ + Ro(1, 0) 


F"(0) 
2 


= F(0)+ F'(0)4 


where R>(1, 0) = i ay F(t) dt. Now we use the chain rule to calculate deriva- 


tives of F in terms of partial derivatives of f: 


F(t) = Df(a + thyh = Y” fy (a + th)h;; 


i=l 


i=1 Lj= i,j=l 


F'(t) = a | frx, (a + mn | h= 2 Six; (a + th)h;h;; 
1 


n 


F(t) = id » Jaxa la + an hy — RRAC) + th)hih;hg. 


k=1 Li,j=1 i,j,k=1 


Thus, (15) becomes 


n 1 n 
Fath) = fla)+ D7 fai + 5 Do fax, Ohh 
i=l i,j=l 


+ 2; Sixx (a+ th)hjhjhy dt, 
0 


i,j,k=1 2 
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or, equivalently, 


n 1 n 
FO) = f+) fli — a+ 5 Dy farsa aa — ay) 
i=1 


i,j=l 
+ R(x, a), 
where the multivariable remainder is 
1 Lit 
Raa= > f faxa (a + th)hihjhy dt. (16) 
i,j.k=1 99 2 


We must still show that | R2(x, a)|/||x — all? —> 0 as x — a, or, equivalently, 
that | R2(x, a)|/|h|/? —> 0 as h — 0. To demonstrate this, note that, for a and h 
fixed, the expression (1 — t)? Jxix;x (a + th) is continuous for ¢ in [0, 1] (since f 
is assumed to be of class C*), hence bounded. In addition, for i = 1, ...,n, we 
have that |/;| < ||h||. Hence, 


a-r? 
|Ro(x, a)| =| 5° — fuxa la + thyhihjhy dt 
i,j,k=1 Y0 
E Jay 
DP 5 fuxa (a + th)hihjhy| dt 
i,j,k=1 79 
n 1 
z2 i Mh) dt = n° M|h|° = n° Mx — all. 
i,j.k=1 49 
Thus, 
R(x, a 
Maaa < n? M|ix— all > 0 
Ix — all 
asx > a. 
Finally, we remark that entirely similar arguments may be given to establish 
results for Taylor polynomials of orders higher than two. a 


Lagrange’s formula for the remainder (see page 257) Using the function 

F(t) = f(a+ th) defined in the proof of Theorem 1.5, Proposition 1.2 implies 

that there must be some number c between 0 and 1 such that the one-variable 

remainder is 

F” (c) 
3! 


R(1, 0) = a- 0ř. 


Now, the remainder term R2(1, 0) from Proposition 1.2 is precisely R2(x, a) in 
Theorem 1.5 and 


n n 


F" = Y` fanaat ch)hihh= X > farxjx@hibjhe, 


i, j,k=1 i, j,k=1 
where z = a + ch. Since c is between 0 and 1, the point z lies on the line segment 
joining a and x = a + h, and so 


n 


1 
Raad = 3D) fay hihihi 


`i j,k=l 


which is the result we desire. The derivation of the formula for R(x, a) fork > 2 
is analogous. m 
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Exercises 


In Exercises 1-7, find the Taylor polynomials p; of given order 
k at the indicated point a. 


1. f(x) =e*,a=0,k =4 

2. f(x) =n +x),a=0,k =3 
3. f(x)=1/x*?,a=1k =4 

4. f(x)=JSx,a=1,k =3 

5. f(x)=J/x,a=9,k =3 

6. f(x) =sinx,a=0,k =5 

7. f(x) =sinx,a=7/2,k =5 


In Exercises 8—15, find the first- and second-order Taylor poly- 
nomials for the given function f at the given point a. 


14. 
15. 


. f(x,y) = 1/(x? + y? + 1), a = (0, 0) 

. fey= 1G? +y +1), a= (1, —1) 

© f(x,y) =e*,a = (0, 0) 

. f(x, y) = e™ cos3y, a = (0, 7) 

. f(x, y, z) = ye™ + ze”, a = (0, 0, 2) 

. fŒ, y, 2) = xy — 3y? + 2xz, a = (2, —1, 1) 
fay z) = 1/(x? + y? +z? + 1), a = (0, 0, 0) 
f(x, y, Z) = sinxyz, a = (0, 0, 0) 


In Exercises 16-20, calculate the Hessian matrix H f(a) for 
the indicated function f at the indicated point a. 


16 
17 


18. 


19. 
20. 
21. 


22. 


23. 


. fœ, y) = 1/6? + y? + 1), a = (0, 0) 
- f(x, y) = cosx sin y, a = (7/4, 7/3) 


fœ, y,z)= Fay ta 12, —4) 


f(x, y, Z) = x? + x?y — yz? + 2z, a = (1, 0, 1) 
f(x, y, z) = e™7? sin 5z, a = (0, 0, 0) 


For f and a as given in Exercise 8, express the second- 
order Taylor polynomial p2(x, y), using the derivative 
matrix and the Hessian matrix as in formula (10) of 
this section. 


For f and a as given in Exercise 11, express the second- 
order Taylor polynomial p2(x, y), using the derivative 
matrix and the Hessian matrix as in formula (10) of 
this section. 


For f and aas given in Exercise 12, express the second- 
order Taylor polynomial p(x, y, z), using the deriva- 
tive matrix and the Hessian matrix as in formula (10) 
of this section. 


24. 


25. 


26. 


27. 


For f and a as given in Exercise 19, express the second- 
order Taylor polynomial p(x, y, z), using the deriva- 
tive matrix and the Hessian matrix as in formula (10) 
of this section. 


Consider the function 


f(%1, X2, oe 


(a) Calculate Df(0,0,...,0) and Hf(0,0,..., 0). 


(b) Determine the first- and second-order Taylor poly- 
nomials of f at 0. 


E Xn) — eX texan te +NnXn i 


(c) Use formulas (3) and (10) to write the Taylor poly- 
nomials in terms of the derivative and Hessian 
matrices. 


Find the third-order Taylor polynomial p3(x, y, z) of 
f (x, y, z) = et t2yt3z 


at (0, 0, 0). 
Find the third-order Taylor polynomial of 


fy, z) = xf + xy + 2y? — x2? + 2x7y 4 
(a) at (0, 0, 0). 


(b) at (1, —1, 0). 


Determine the total differential of the functions given in 
Exercises 28-32. 


28. 
29. 
30. 
31. 
32. 
33. 


34. 


35. 


fE, y) =y 
f(x, y, z) = x? + 3y? — 2z? 
f(x, y, z) = cos (xyz) 


f(x, y,z) = e* cosy +e? sinz 


f(x, y, 2) = 1/,/xyz 


Use the fact that the total differential df approximates 
the incremental change Af to provide estimates of the 
following quantities: 


(a) (7.07? 1.98} 

(b) 1//(4.1)(1.96)(2.05) 

(c) (1.1) cos ((a — 0.03)(0.12)) 

Near the point (1, —2, 1), is the function g(x, y, z) = 


x? — 2xy + x?z + 7z most sensitive to changes in x, 
y, or z? 


To which entry in the matrix is the value of the 
determinant 
3 
=I 5 


most sensitive? 


36. 


37. 


38. 


39. 


Figure 4.12 The graph of 
z= f(x, y). 


If you measure the radius of a cylinder to be 2 in, with 
a possible error of +0.1 in, and the height to be 3 in, 
with a possible error of +0.05 in, use differentials to 
determine the approximate error in 

(a) the calculated volume of the cylinder. 


(b) the calculated surface area. 


A can of mushrooms is currently manufactured to have 
a diameter of 5 cm and a height of 12 cm. The man- 
ufacturer plans to reduce the diameter by 0.5 cm. Use 
differentials to estimate how much the height of the 
can would need to be increased in order to keep the 
volume of the can the same. 


Consider a triangle with sides of lengths a and b that 

make an interior angle 0. 

(a) Ifa = 3,b = 4, and6@ = 77/3, to changes in which 
of these measurements is the area of the triangle 
most sensitive? 


(b) Ifthe length measurements in part (a) are in error 
by as much as 5% and the angle measurement is 
in error by as much as 2%, estimate the resulting 
maximum percentage error in calculated area. 


To estimate the volume of a cone of radius approx- 
imately 2 m and height approximately 6 m, how ac- 
curately should the radius and height be measured so 
that the error in the calculated volume estimate does 


40. 


41. 


42. 


43. 
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not exceed 0.2 m°? Assume that the possible errors in 
measuring the radius and height are the same. 


Suppose that you measure the dimensions of a block 
of tofu to be (approximately) 3 in by 4 in by 2 in. 
Assuming that the possible errors in each of your mea- 
surements are the same, about how accurate must your 
measurements be so that the error in the calculated 
volume of the tofu is not more than 0.2 in*? What per- 
centage error in volume does this represent? 


(a) Calculate the second-order Taylor polynomial for 
f(x, y) =cosx sin y at the point (0, 2/2). 

(b) If h = (hı, h2) = (x, y) — (0, 7/2) is such that 
|h,| and |h2| are no more than 0.3, estimate how 
accurate your Taylor approximation is. 


(a) Determine the second-order Taylor polynomial of 
f(x, y) = e*t” at the origin. 

(b) Estimate the accuracy of the approximation if |x| 
and |y| are no more than 0.1. 


(a) Determine the second-order Taylor polynomial of 
f(x, y) = e™ cos y at the point (0, 2/2). 

(b) If h = (hı, h2) = (x, y) — (0, 2/2) is such that 
|hi| < 0.2 and |h2| < 0.1, estimate the accuracy 
of the approximation to f given by your Taylor 
polynomial in part (a). 


4.2 Extrema of Functions 


The power of calculus resides at least in part in its role in helping to solve a wide 
variety of optimization problems. With any quantity that changes, it is natural to 
ask when, if ever, does that quantity reach its largest, its smallest, its fastest or 
slowest? You have already learned how to find maxima and minima of a function 
of a single variable, and no doubt you have applied your techniques to a number of 
situations. However, many phenomena are not appropriately modeled by functions 
of only one variable. Thus, there is a genuine need to adapt and extend optimization 
methods to the case of functions of more than one variable. We develop the 
necessary theory in this section and the next and explore a few applications in $4.4. 


Critical Points of Functions 


Let X be open in R” and f: X C R” — Ra scalar-valued function. 


Max. 


DEFINITION 2.1 We say that f has a local minimum at the point a in 
X if there is some neighborhood U of a such that f(x) > f(a) for all x 
in U. Similarly, we say that f has a local maximum at a if there is some 
neighborhood U of a such that f(x) < f(a) for all x in U. 


When n = 2, local extrema of f(x, y) are precisely the pits and peaks of the 
surface given by the graph of z = f(x, y), as suggested by Figure 4.12. 
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We emphasize our use of the adjective “local.” When a local maximum of 
a function f occurs at a point a, this means that the values of f at points near 
a can be no larger, not that all values of f are no larger. Indeed, f may have 
local maxima and no global (or absolute) maximum. Consider the graphs in 
Figure 4.13. (Of course, analogous comments apply to local and global minima.) 


Global 


maximum 


Local 
maximum 


No global 
maximum 


Figure 4.13 Examples of local and global maxima. 


Recall that, if a differentiable function of one variable has a local extremum 
at a point, then the derivative vanishes there (i.e., the tangent line to the graph 
of the function is horizontal). Figures 4.12 and 4.13 suggest strongly that, if a 
function of two variables has a local maximum or minimum at a point in the 
domain, then the tangent plane at the corresponding point of the graph must be 
horizontal. Such is indeed the case, as the following general result (plus formula 
(4) of §2.3) implies. 


THEOREM 2.2 Let X be open in R” and let f: X C R” — R be differentiable. 
If f has a local extremum at a € X, then Df(a) = 0. 


PROOF Suppose, for argument’s sake, that f has a local maximum at a. Then the 
one-variable function F defined by F(t) = f(a + th) must have a local maximum 
att = 0 for any h. (Geometrically, the function F is just the restriction of f to the 
line through a parallel to h as shown in Figure 4.14.) From one-variable calculus, 
we must therefore have F’(0) = 0. By the chain rule 


F'O) = £ f(a+th)] = Df(a + thh = V f(a + th) -h. 


Graph of f 
restricted to line 


Figure 4.14 The graph of f restricted to a line. 


Figure 4.15 The function f is 
strictly positive on the shaded 
region, strictly negative on the 
unshaded region, and zero along 
the lines y = +x. 
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Hence, 
0= F'(0) = Df(a)h = fa (ahy + fe (ahyz +e fe (aan. 


Since this last result must hold for all h € R”, we find that by setting h in turn 
equal to (1,0,..., 0), (0, 1,0,...,0),...,(0,...,0, 1), we have 


faa) = foa) = = fanla) = 0. 
Therefore, Df (a) = 0, as desired. E 


A point a in the domain of f where Df (a) is either zero or undefined is called 
a critical point of f. Theorem 2.2 says that any extremum of f must occur at a 
critical point. However, it is by no means the case that every critical point must 
be the site of an extremum. 


EXAMPLE 1 If f(x, y) =x? — y’, then Df(x, y) = [| 2x —2y ] so that, 
clearly, (0, 0) is the only critical point. However, neither a maximum nor a mini- 
mum occurs at (0, 0). Indeed, inside every open disk centered at (0, 0), no matter 
how small, there are points for which f(x, y) > f (0, 0) = 0 and also points where 
f(x,y) < f(0, 0). (See Figure 4.15.) + 


This type of critical point is called a saddle point. Its name derives from the 
fact that the graph ofz = f(x, y) looks somewhat like a saddle. (See Figure 4.16.) 


Figure 4.16 A saddle point. 


EXAMPLE 2 Let f(x, y) = \/x? + y?. The domain of f is all of R?. We com- 
2x 2y . 

32+ ype 3024 <a | note that Df is unde- 

fined at (0,0) and nonzero at all other (x, y) € R?. Hence, (0, 0) is the only 

critical point. Since f(x, y) > 0 for all (x, y) and has value 0 only at (0, 0), we 

see that f has a unique (global) minimum at (0, 0). + 


pute that Df(x, y) = | 


The Nature of a Critical Point: The Hessian Criterion ——— 


We illustrate our current understanding regarding extrema with the following 
example: 


EXAMPLE 3 We find the extrema of 
fœ, y) =x +xy +y +2x—2y+5. 
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f(a) 


Figure 4.17 An 
upward-opening parabola. 


f@+ 


Figure 4.18 A 
downward-opening parabola. 


Since f is a polynomial, it is differentiable everywhere, and Theorem 2.2 implies 
that any extremum must occur where df/dx and df/dy vanish simultaneously. 
Thus, we solve 


0 
ree 
Ox 
0 $ 
CENTE E ET 
dy 


and find that the only solution is x = —2, y = 2. Consequently, (—2, 2) is the 
only critical point of this function. 

To determine whether (—2, 2) is a maximum or minimum (or neither), we 
could try graphing the function and drawing what we hope would be an obvious 
conclusion. Of course, such a technique does not extend to functions of more than 
two variables, so a graphical method is of limited value at best. Instead, we’ll see 
how f changes as we move away from the critical point: 


Af = f(-2+h,2+k)— f(-2, 2) 
= [(-2+ hy + (—2 + hX(2 + k) + (2 + k? 
H2(—2 +h) —2(2+k)+5]-1 
=h +hk+k. 
If the quantity Af = h? + hk + k? is nonnegative for all small values of h and k, 
then (—2, 2) yields a local minimum. Similarly, if A f is always nonpositive, then 
(—2, 2) must yield a local maximum. Finally, if Af is positive for some values 


of h and k and negative for others, then (—2, 2) is a saddle point. To determine 
which possibility holds, we complete the square: 


Af =W+hk +R =h? + hk + iK + 3? = (h + dk) + 3k. 


Thus, Af > 0 for all values of h and k, so (—2, 2) necessarily yields a local 
minimum. + 


Example 3 with its attendant algebra clearly demonstrates the need for a better 
way of determining when a critical point yields a local maximum or minimum (or 
neither). In the case ofa twice differentiable function f: X C R > R, you already 
know a quick method, namely, consideration of the sign of the second derivative. 
This method derives from looking at the second-order Taylor polynomial of f 
near the critical point a, namely, 


FO) ® pala) = fla) + fac -a+ Ge ~ ay 
= fa) + Oe -ay, 


since f’ is zero at the critical point a of f. If f”(a) > 0, the graph of y = po(x) 
is an upward-opening parabola, as in Figure 4.17, whereas if f”(a) < 0, then 
the graph of y = p2(x) looks like the one shown in Figure 4.18. If f(a) = 0, 
then the graph of y = p2(x) is just a horizontal line, and we would need to use 
a higher-order Taylor polynomial to determine if f has an extremum at a. (You 
may recall that when f”(a) = 0, the second derivative test from single-variable 
calculus gives no information about the nature of the critical point a.) 

The concept is similar in the context of n variables. Suppose that 


fO) = f, X2,..., Xn) 
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is of class C? and that a = (a1, a2, . . . , an) is a critical point of f. Then the 
second-order Taylor approximation to f gives 


Af = f(x) — fla) © pax) — f(a) 
= Df(a)(x — a) + (x — a)" H f(a) — a) 


when x ~ a. (See Theorem 1.5 and formula (10) in §4.1.) Since f is of class C? 
and a is a critical point, all the partial derivatives vanish at a, so that we have 
Df (a) = 0 and, hence, 


Af ~ 3(x— a)" Hf (a) — a). (1) 


The approximation in (1) suggests that we may be able to see whether the in- 
crement Af remains positive (respectively, remains negative) for x near a and, 
hence, whether f has a local minimum (respectively, a local maximum) at a by 
seeing what happens to the right side. 

Note that the right side of (1), when expanded, is quadratic in the terms 
(x; — ai). More generally, a quadratic form in hı, h2, ..., fn is a function Q 
that can be written as 


Oh, hy, a) hy) = >» bijhihj, 
ij=l 


where the b;;’s are constants. The quadratic form Q can also be written in terms 
of matrices as 


bi biz + bin hı 
by, bn +++ bay, hy 

Q(h)=[hi ho +s ha] 2, ~ |=h'Bh, © 
bni bn2 oe brn hy, 


where B = (b;;). Note that the function Q is unchanged if we replace all b;; with 
5(b; j + bji). Hence, we may always assume that the matrix B associated to Q is 
symmetric, that is, that b;; = bj; (or, equivalently, that B” = B). Ignoring the 
factor of 1/2, we see that the right side of (1) is the quadratic form in h = x — a, 
corresponding to the matrix B = Hf (a). 

A quadratic form Q (respectively, its associated symmetric matrix B) is said 
to be positive definite if O(h) > 0 forall h Æ 0 and negative definite if O(h) < 0 
for all h Æ 0. Note that if Q is positive definite, then Q has a global minimum (of 
0) at h = 0. Similarly, if Q is negative definite, then Q has a global maximum at 
h=0. 

The importance of quadratic forms to us is that we can judge whether f has 
a local extremum at a critical point a by seeing if the quadratic form in the right 
side of (1) has a maximum or minimum at x = a. The precise result, whose proof 
is given in the addendum to this section, is the following: 


THEOREM 2.3 Let U C R” be open and f:U — R a function of class C?. 
Suppose that a € U is a critical point of f. 


1. If the Hessian Hf (a) is positive definite, then f has a local minimum at a. 
2. Ifthe Hessian H f(a) is negative definite, then f has a local maximum at a. 


3. If det H f(a) Æ 0 but H f(a) is neither positive nor negative definite, then f 
has a saddle point at a. 
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In view of Theorem 2.3, the issue thus becomes to determine when the Hessian 
Hf (a) is positive or negative definite. Fortunately, linear algebra provides an 
effective means for making such a determination, which we state without proof. 
Given a symmetric matrix B (which, as we have seen, corresponds to a quadratic 


form Q), let Bz, fork = 1, ..., n, denote the upper leftmost k x k submatrix of 
B. Calculate the following sequence of determinants: 
bi bn 
det B, =b f det B= 
1 ll 2 | bai Bes 
bi bn b 
det B3 = | bx} bun bz |,...,detB, = det B. 
b31 b32 b33 


If this sequence consists entirely of positive numbers, then B and Q are positive 
definite. If this sequence is such that det B < 0 for k odd and det B, > 0 for k 
even, then B and Q are negative definite. Finally, if det B Æ 0, but the sequence 
of determinants det B4, det B2, ..., det B, is neither of the first two types, then B 
and Q are neither positive nor negative definite. Combining these remarks with 
Theorem 2.3, we can establish the following test for local extrema: 


Second derivative test for local extrema. Given a critical point a of a func- 
tion f of class C?, look at the Hessian matrix evaluated at a: 


fax (a) Fix, (a) rae Pex (a) 
Hf (a) = L (a) ae - at (a) 
Fran (a) Teas (a) aca Feats (a) 


From the Hessian, calculate the sequence of principal minors of H f(a). 
This is the sequence of the determinants of the upper leftmost square sub- 
matrices of Hf(a). More explicitly, this is the sequence d1, d2,..., dn, 
where dy = det Hy, and H; is the upper leftmost k x k submatrix of H f(a). 
That is, 


dı = Iaa 

Jasa (a) foa) 

Ee fO 

Jsa (a) Poa) eee) 

dz =} fox (a) fron (a) fona) |z- dn = |H fa). 
foala) fonla) forla) 


The numerical test is as follows: 
Assume that d,, = det Hf (a) 4 0. 


dy = 


1. Ifd, > 0 fork =1,2,...,n, then f has a local minimum at a. 


2. Ifd, < 0 for k odd and d; > 0 for k even, then f has a local maximum 
at a. 


3. If neither case 1 nor case 2 holds, then f has a saddle point at a. 
In the event that det Hf(a) = 0, we say that the critical point a is degenerate 


and must use another method to determine whether or not it is the site of an 
extremum of f. 


4.2 | Extrema of Functions 269 


EXAMPLE 4 Consider the function 
fa, yy=x?+xyty?+2x-—2y+5 


in Example 3. We have already seen that (—2, 2) is the only critical point. The 


Hessian is 
— Jex Tey = 2 1 
He | > | : | E | 


The sequence of principal minors is dı = fyx(—2,2)=2 (> 0), d = 
|Hf (—2, 2)| = 3 (> 0). Hence, f has a minimum at (—2, 2), as we saw before, 
but this method uses less algebra. + 


EXAMPLE 5 (Second derivative test for functions of two variables) Let us 
generalize Example 4. Suppose that f(x, y) is a function of two variables of class 
C? and further suppose that f has a critical point at a = (a, b). The Hessian 
matrix of f evaluated at (a, b) is 


Hf (a, b) = | fxx(a, b) fry(a, b) | 


f(a, b) fyy(a, b) 


Note that we have used the fact that fyy = fyx (since f is of class C?) in con- 
structing the Hessian. The sequence of principal minors thus consists of two 
numbers: 


dı = fxx(a,b) and d= fyx(a, b) fyy(a, b) — fela, bY. 

Hence, in this case, the second derivative test tells us that 
1. f has a local minimum at (a, b) if 

fsx(a,b)>0 and frx(a, b) fyy(a, b) — fryla, bY > 0. 
2. f has a local maximum at (a, b) if 

fex(a,b) <0 and frx(a, b)fyyla, b)— fela, bY > 0. 
3. f has a saddle point at (a, b) if 

fex(a, b) fyy(a, b) — fry(a, bY < 0. 


Note that if fxx(a, b) fyy(a, b) — fxy(a, b} = 0, then f has a degenerate critical 
point at (a, b) and we cannot immediately determine if (a, b) is the site of a local 
extremum of f. + 


EXAMPLE 6 Let f(x, y, z) = x? + xy? + x? + y? + 3z. To find any local 
extrema of f, we must first identify the critical points. Thus, we solve 


Df (x, y, z) = [3x7 + y? + 2x 2xy + 2y 6z]=[0 0 0]. 


From this, it is not hard to see that there are two critical points: (0, 0, 0) and 
(—4, 0, 0). The Hessian of f is 


6x + 2 2y 0 
Af (x, y,z)= 2y 2x +2 0 
0 0 6 
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Figure 4.19 Away from the 
origin, the function h of Example 7 
is negative along the x-axis and 
positive along the y-axis. 


At the critical point (0, 0, 0), we have 


2 0 0 
Hf(0,0,0)=| 0 2 0l, 
0 0 6 


and its sequence of principal minors is dı = 2, d = 4, d3 = 24. Since these 
determinants are all positive, we conclude that f has a local minimum at (0,0,0). 
At (—3, 0, 0), we calculate that 

1 


2 
Hf (-5.0.0) = 0 
6 


The sequence of minors is —2, — 4, —8. Hence, f has a saddle point at (— x, 0, 0). 
+ 


0 


l 
© N 
O un O 


EXAMPLE 7 To get a feeling for what happens in the case of a degenerate 
critical point (i.e., a critical point a such that det H f (a) = 0), consider the three 
functions 


fa, yaxt ter ty 
gax, y) = xt- x? =F, 
and 
h(x, y) = xf =- x? + y". 


We leave it to you to check that the origin (0, 0) is a degenerate critical point 
of each of these functions. (In fact, the Hessians themselves look very similar.) 
Since f is 0 at (0, 0) and strictly positive at all (x, y) Æ (0, 0), we see that f 
has a strict minimum at the origin. Similar reasoning shows that g has a strict 
maximum at the origin. For h, the situation is slightly more complicated. Along 
the y-axis, we have h(0, y) = y*, which is zero at y = 0 (the origin) and strictly 
positive everywhere else. Along the x-axis, 


h(x, 0) = xt — x? = x? (x — D(x + 1). 


For —1 < x < 1 and x 40, h(x, 0) < 0. We have the situation depicted in Fig- 
ure 4.19. Thus, every neighborhood of (0, 0) contains some points (x, y) where h 
is positive and also some points where / is negative. Therefore, h has a saddle point 
at the origin. The “moral of the story” is that a degenerate critical point can exhibit 
any type of behavior, and more detailed consideration of the function itself, rather 
than its Hessian, is necessary to understand its nature as a site ofan extremum. ® 


Global Extrema on Compact Regions 


Thus far our discussion has been limited to consideration of only local extrema. 
We have said nothing about how to identify global extrema, because there really is 
no general, effective method for looking at an arbitrary function and determining 
whether and where it reaches an absolute maximum or minimum value. For the 
purpose of applications, where finding an absolute maximum or minimum is 
essential, such a state of affairs is indeed unfortunate. Nonetheless, we can say 
something about global extrema for functions defined on a certain type of domain. 
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R? R? 


Figure 4.20 Compact regions. 


DEFINITION 2.4 A subset X C R” is said to be compact if it is both closed 
and bounded. 


Recall that X is closed if it contains all the points that make up its boundary. 
(See Definition 2.3 of §2.2.) To say that X is bounded means that there is some 
(open or closed) ball B that contains it. (That is, X is bounded if there is 
some positive number M such that ||x|| < M for all x € X.) Thus, compact sets 
contain their boundaries (a consequence of being closed) and have only finite 
extent (a consequence of being bounded). Some typical compact sets in R? and 
R? are shown in Figure 4.20. 

For our purposes the notion of compactness is of value because of the next 
result, which we state without proof. 


THEOREM 2.5 (EXTREME VALUE THEOREM) If X C R” is compact and f: 
X — R is continuous, then f must have both a global maximum and a global 
minimum somewhere on X. That is, there must exist points amax and amin in X 
such that, for all x € X, 


f (amin) $ f(x) < f (Amax): 


We need the compactness hypothesis since a function defined over a noncom- 
pact domain may increase or decrease without bound and, hence, fail to have any 
global extremum, as suggested by Figure 4.21. This is analogous to the situation 
in one variable where a continuous function defined on an open interval may fail 
to have any extrema, but one defined on a closed interval (which is a compact 
subset of R) must attain both maximum and minimum values. (See Figure 4.22.) 
In the one-variable case, extrema can occur either in the interior of the interval 
or else at the endpoints. Therefore, you must compare the values of f at any 
interior critical points with those at the endpoints to determine which is largest 
and smallest. In the case of functions of n variables, we do something similar, 
namely, compare the values of f at any critical points with values at any restricted 
critical points that may occur along the boundary of the domain. 
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No global 
minimum 


x 


Figure 4.21 A graph that lacks a 
global minimum. 


y=-l 


Figure 4.23 The domain of the 
function T of Example 8. 


No maximum or minimum attained 


y 
Global 


maximum 


x Global 


minimum 
i 
T 


a b 


>+ 


= 


Figure 4.22 The function depicted by the graph on the left has no global extrema—the 
function is defined on the open interval (a, b). By contrast, the function defined on the 
closed interval [a, b], and with the graph on the right, has both a global maximum and 
minimum. 


EXAMPLE 8 Let 7: X c R? > R be given by 
T(x, y) =x? -xy +y +1, 


where X is the closed square in Figure 4.23. (Note that X is compact.) Think of 
the square as representing a flat metal plate and the function T as the temperature 
of the plate at each point. Finding the global extrema amounts to finding the 
warmest and coldest points on the plate. According to Theorem 2.5, such points 
must exist. 

We need to find all possible critical points of T. Momentarily considering T 
as a function on all of R?, we find the usual critical points by setting DT (x, y) 
equal to 0. The result is the system of two equations 


2x -—y=0 
—x+2y=0’ 
which has (0, 0) as its only solution. Whether it is a local maximum or minimum 
is not important for now, because we seek global extrema. Because there is only 
one critical point, at least one global extremum must occur along the boundary of 


X (which consists of the four edges of the square). We now find all critical points 
of the restriction of T to this boundary: 


1. The bottom edge of X is the set 
E = {@,y)|y=—1,-] <x < 2}. 
The restriction of T to E; defines a new function f;:[—1, 2] —> R given by 
fits) = T(x, -1) =x? +x42. 


As fi(x) = 2x + 1, the function fı has a critical point at x = —3. Thus, 
we must examine the following points of X for possible extrema: (- L, — 1), 
(—1, —1), and (2, — 1). (The first point is the critical point of fı, and the second 
two are the vertices of X that lie on £1.) 


2. The top edge of X is given by 
E: = {(x, y)| y =2, -1 < x <2}. 
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Consequently, we define f2:[—1, 2] —> R by 
fo(x) = T(x, 2) = x? — 2x +5. 


(fo is the restriction of T to E2.) We calculate f;(x) = 2x — 2, which implies 
that x = 1 is a critical point of f2. Hence, we must consider (1, 2), (—1, 2), 
and (2, 2) as possible sites for global extrema of T. (The points (—1, 2) and 
(2, 2) are the remaining two vertices of X.) 


3. The left edge of X is 
E; = {(x, y)|x=—1,-l<y < 2}. 
Therefore, we define f3 : [—1, 2] —> R by 
BOSTE, y)=y ty +2. 
We have f3(y) = 2y + 1, and so y = —} is the only critical point of f3. Thus 


(- 1,- ) is a potential site of a global extremum. (We need not worry again 


about the vertices (—1, —1) and (—1, 2).) 
. The right edge of X is 


E,={(x,y)|x=2,-l<y <2}. 
We define f4:[—1, 2] —> R by 
fay) = TQ, y) = y? — 2y +5. 


We have f(y) = 2y — 2, and so y = 1 is the only critical point of f4. Hence, 
we must include (2, 1) in our consideration. 


A 


Consequently, we have nine possible locations for global extrema, shown in 
Figure 4.24. Now we need only to compare the actual values of T at these points 
to see that (0, 0) is the coldest point on the plate and both (2, —1) and (—1, 2) are 
the hottest points. + 


If a function is defined over a noncompact region, there is no general result 
like the extreme value theorem (Theorem 2.5) to guarantee existence ofany global 
extrema. However, ad hoc arguments frequently can be used to identify global 
extrema. 


y 
(-1,2) (1,2) (2,2) 
1 
(2,1) i 
(0,0) r 
: ba X 8 
(-1, -3) 4 
(-1,-1) / (2,-1) 8 
(-3,-1) 5 
7 
4 


Figure 4.24 Possible global extrema for T. 
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EXAMPLE 9 Consider the function f(x, y) = e!~3*’-”” defined on all of R? 
(so the domain is certainly not compact). Verifying that f has a unique critical 
point at (0, 0) is straightforward. We leave it to you to check that the Hessian 
criterion implies that f has a local maximum there. In any case, forall (x, y) € R’, 
we have 

133° -y <1. 
Therefore, because the exponential function is always increasing (i.e., if uy < u2, 
then e”! < e"), 

el 3 -y? < e! =e. 


As f(0, 0) = e, we see that f has a global maximum at (0, 0). + 
WARNING Itis tempting to assume that if a function has a unique critical point 
that is a local extremum, then it must be a global extremum as well. Although 


true for the case of a function of a single variable, it is not true for functions of 
two or more variables. (See Exercise 52 for an example.) 


Addendum: Proof of Theorem 2.3 


Step 1. We show the following key property of a quadratic form Q, namely, that 
if à € R, then 


Q(Ah) = A? 0h). (3) 


This is straightforward to establish if we write Q in terms of its associated sym- 
metric matrix B and use some of the properties of matrix arithmetic given in §1.6: 


Q(Ah) = (Ah)? B(Ah) = Ah? B(Ah) = 47h’ Bh = 27 O(h). 


Step 2. We show that if B is the symmetric matrix associated to a positive 
definite quadratic form Q, then there is a positive constant M such that 


O(h) > M|jh||? 


for all h € R”. 

First, note that when h = 0, then Q(h) = Q(0) = 0 so the conclusion holds 
trivially in this case. 

Next, suppose that h is a unit vector (1.e., ||h|| = 1). The (endpoints of the) set 
of all unit vectors in R” is an (n — 1)-dimensional sphere S, which is a compact 
set. Hence, by the extreme value theorem (Theorem 2.5), the restriction of Q to 
S must achieve a global minimum value M somewhere on S. Thus, Q(h) > M 
forallhe S. 

Finally, let h be any nonzero vector in R”. Then its normalization h/||h|] is a 
unit vector and so lies in S. Therefore, by the result of Step 1, we have 


h h 
h) = h|| — ) = |[h||? | — h|? M, 
Q(h) (1 Il aa) || h|| o(a) > ||hl| 
since h/||h|| is in S. 


Step 3. Now we prove the theorem. By the second-order Taylor formula 
Theorem 1.5 and formula (10) of §4.1, we have that, for the critical point a of f, 


Af = fŒ) - f(a) = 30- a) Hf (a) — a) + Ro(x, a), (4) 


where | R2(x, a)|/||x — all? > 0 as x > a. 


4.2 | Extrema of Functions 275 


Suppose first that H f(a) is positive definite. Then by Step 2 with h = x — a, 
there must exist a constant M > 0 such that 


3(x — a)’ Hf(a)(x— a) > MIx — all’. (5) 


Because | R2(x, a)|/||x — all? — 0 as x — a, there must be some ô > 0 so that if 
0 < ||x — all < ô, then |R2(x, a)|/||x — all? < M, or, equivalently, 


|Ro(x, a)| < M||x — all’. (6) 
Therefore, (4), (5), and (6) imply that, for 0 < ||x — all < ô, 
Af >0 


so that f has a (strict) local minimum at a. 


If H f(a) is negative definite, then consider g = — f. We see that a is also a 
critical point of g and that H g(a) = —H f (a), so H g(a) is positive definite. Hence, 
the argument in the preceding paragraph shows that g has a local minimum at a, 
so f has a local maximum at a. 

Now suppose det H f(a) 4 0, but that H f(a) is neither positive nor negative 
definite. Let x; be such that 

5(x, — a)’ Hf(a)(x; — a) > 0 
and x» such that 

5(X2 — a)’ Hf (a)(x2 — a) < 0. 
(Since det H f(a) Æ 0, such points must exist.) For i = 1, 2 let 

yi(t) = t; — a) + a, 
the vector parametric equation for the line through a and x;. Applying formula 
(4) with x = y;,(t), we see 
Af = flyi(t)) — f(a) = 3(yi(t) — a)" Hf (ay(yi(t) — a) + Ro(yi(0), a) 


2 Ro(yi(t), a) 

llyi(t) — all? 
Note that y;(t) — a = t(x; — a). Therefore, using the property of quadratic forms 
given in Step 1 and the fact that ||y;(t) — all? = ||¢(x; — a)||? = £x; — all?, we 
have 


II 


x(yi(t) — a)” Hf (ay(yi(t) — a) + lly:®© — all 


II 


fO) — fa) 
(7) 


=? [So -a Afar a) + ix; — al? 2 | 


llyi(¢) — all? 
Now note that, for i = 1, the first term in the brackets in the right side of (7) is a 
positive number P and, for i = 2, it is a negative number N. Set 


P N 
M = min ( zi :). 
x: — all [x2 — all 


Because we know that | R2(y;(t), a)|/lly;(t) — all? 0 as t > 0, we can find 
some ô > 0 so that if 0 < t < ô, then 


| Ro(yi(t), a)l 
lly:() — all? 
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But this implies that, for 0 < t < ô, 


while 


Af = f0) — f(a) > 9, 


Af = flya(t)) — fa) < 0. 


Thus, f has a saddle point at x = a. a 


4.2 Exercises 


1. 


Concerning the function f(x, y) =4x + 6y — 12 — 

vy? 

(a) There is a unique critical point. Find it. 

(b) By considering the increment Af, determine 
whether this critical point is a maximum, a mini- 
mum, or a saddle point. 

(c) Now use the Hessian criterion to determine the 
nature of the critical point. 


. This problem concerns the function g(x, y) = x? — 


Qy? +2x +3. 
(a) Find any critical points of g. 


(b) Use the increment Ag to determine the nature of 
the critical points of g. 


(c) Use the Hessian criterion to determine the nature 
of the critical points. 


In Exercises 3—20, identify and determine the nature of the 
critical points of the given functions. 


3. 


f(x, y) = 2xy — 2x? — 5y? +4y —3 


4. f(x, y)= ln(x? + y? +1) 
5. 
6 


f(x, y) =x? + y? — 6xy + 3x + 6y 


. JŒ, yay? —2xy +x? -x 


8&8 1 
f(x, y)=xy+-+- 
x y 


8. f(x, y) =e" siny 


f(x, y) =e- y’) 


» fŒ, y) = @+y)0 — xy) 

-f@ y)=x -y xy +y 

~ f(x, y) = e™ + 3y?) 

« f(x, y) = 2x —3y + Inxy 

. f(x, y) = cosx sin y 

« [@. 72S x7 = žy +z = Dez + 6z 
~ f(y, z) = (x? + 2y? + 1) cosz 

. fŒ, y, z) =x? + y? +277 4 xz 


18. f(x, y, Z) = x? + xz? — 3x? + y? +22? 
1 
19. fœ, y z) =xy +xz+2yz +7 


20. f(x, y, z) = (x? = y? = 22’) 
21. (a) Find all critical points of f(x,y)= 
2y? — 3y? — 36y + 2 
1 + 3x? 
(b) Identify any and all extrema of f. 


22. (a) Under what conditions on the constant k will the 
function 


f(x,y) = kx? — Ixy + ky’ 


have a nondegenerate local minimum at (0, 0)? 
What about a local maximum? 


(b) Under what conditions on the constant k will the 
function 


k 
g(x, y, z) = kx? + kxz — 2yz — y’ + i 
have a nondegenerate local maximum at (0, 0, 0)? 
What about a nondegenerate local minimum? 


23. (a) Consider the function f(x, y)= ax? + by?, 
where a and b are nonzero constants. Show that 
the origin is the only critical point of f , and deter- 
mine the nature of that critical point in terms of a 
and b. 


(b) Now consider the function f(x, y, z) = ax? + 
by? + cz”, where a, b, and c are all nonzero. Show 
that the origin in R? is the only critical point of f, 
and determine the nature of that critical point in 
terms ofa, b, and c. 


(c) Finally, let f(x1,%2,..-,%) = ax? + ax? 


+--+ + anx2, where a; is a nonzero constant for 
i = 1,2,..., n. Show that the origin in R” is the 
only critical point of f, and determine its nature. 


Sometimes it can be difficult to determine the critical point of 
a function f because the system of equations that arises from 
setting V f equal to zero may be very complicated to solve by 
hand. For the functions given in Exercises 24—27, (a) use a 
computer to assist you in identifying all the critical points of 
the given function f, and (b) use a computer to construct the 


Hessian matrix and determine the nature of the critical points 
found in part (a). 


Q 24. 


@ 25 


@ 26. 
Q 27. 


28 


29. 


30. 


31 


32. 


33. 


34. 


35. 


36. 


37. 


f(x, y) = yt +x? —2xy -x 


f(x, y) = 2x°y — y? — 3xy 


fŒ, y, z) = yz — xyz — x? — y? — 22? 


f(x,y,z, w) = yw — xyz x? — 277 + w? 


Show that the largest rectangular box having a fixed 
surface area must be a cube. 


What point on the plane 3x — 4y — z = 24 is closest 
to the origin? 


Find the points on the surface xy + z? = 4 that are 
closest to the origin. Be sure to give a convincing ar- 
gument that your answer is correct. 


Suppose that you are in charge of manufacturing two 
types of television sets. The revenue function, in dol- 
lars, is given by 


R(x, y) = 8x + 6y — x? — 2y? + 2xy, 


where x denotes the quantity of model X sets sold, and 
y the quantity of model Y sets sold, both in units of 
100. Determine the quantity of each type of set that 
you should produce in order to maximize the resulting 
revenue. 


Find the absolute extrema of f(x, y) = x? + xy + 
y? — 6y on the rectangle {(x, y)| -3 <x <3, 0< 
y <= 5}. 


Find the absolute maximum and minimum of 
f(x,y, z) =x +xz— y +22 +xy + 5x 


on the block {(x, y,z)| -S5<x<0, 0<y <3, 
0<z <2}. 


A metal plate has the shape of the region x? + y? < 1. 
The plate is heated so that the temperature at any point 
(x, y) on it is indicated by 


T(x, y) = 2x7 + y?—y43. 


Find the hottest and coldest points on the plate and the 
temperature at each of these points. (Hint: Parametrize 
the boundary of the plate in order to find any critical 
points there.) 


Find the (absolute) maximum and minimum values of 
f(x, y) = sinx cos y on the square R = {(x, y)|0< 
x <2n, 0<y<2z}. 


Find the absolute extrema of f(x, y)=2cosx + 
3siny on the rectangle {(x,y)|0<x<4, 0< 
y <3}. 


Determine the absolute minimum and maximum 
values of the function f(x, y) = 2x? —2xy + y? 


38. 


39. 
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— y + 3 on the closed triangular region with vertices 
(0, 0), (2, 0), and (0, 2). 


Determine the absolute minimum and maximum val- 
ues of the function f(x, y) = x”y on the elliptical re- 
gion D = {(x, y) | 3x? + 4y? < 12}. 

absolute 


Find the extrema of f(x,y,z)= 
e!l == +2y-2 -4z on the ball {(x, y, z) | x? + y? — 2y 
+z +4 < 0}. 


Each of the functions in Exercises 40-45 has a critical point 
at the origin. For each function, (a) check that the Hessian 
fails to provide any information about the nature of the critical 
point at the origin, and (b) find another way to determine if the 
function has a maximum, minimum, or neither at the origin. 


40. 
41. 
42. 
43. 
44. 
45. 


fæ, y) =x 
f(x, y) = 4 — 3x y 
fa y) =y 


fŒ, y, z) = x? y?zt 
f&, y, z) = xyz! 
f(x, y,z)=2-xtyt -z 


In Exercises 46—48, (a) find all critical points of the given 
function f and identify their nature as local extrema and (b) 
determine, with explanation, any global extrema of f. 


46. 
47. 
48. 
49. 


50. 


51. 


52. 


faye Pr 

fæ, yz) = e2- -27-37 

fx, y) =x +y —3xy+7 

Determine the global extrema, if any, of 
f(x, y)=xy+2y —Inx —2Iny, 

where x, y > 0. 


Find all local and global extrema of the function 
f(x, y, Z) = x3 +3x2 + et! 4 2? — 3xz. 

Let fœ, y) = 3 - [œ - DO - 2P⁄. 

(a) Determine all critical points of f. 

(b) Identify all extrema of f. 


(a) Suppose f:R — R is a differentiable function of 
a single variable. Show that if f has a unique crit- 
ical point at xo that is the site of a strict local ex- 
tremum of f, then f must attain a global extremum 
at Xo. 

Let f(x, y) = 3ye* — e? — y?. Verify that f has 
a unique critical point and that f attains a local 
maximum there. However, show that f does not 
have a global maximum by considering how f be- 
haves along the y-axis. Hence, the result of part 
(a) does not carry over to functions of more than 
one variable. 


(b) 


278 Chapter 4 | Maxima and Minima in Several Variables 


53. (a) Let f be a continuous function of one variable. as we now see. Consider the function 
Show that if f has two local maxima, then f must f(x,y) =2—-(xy? —y —1% —(? - 1). 


also have a local minimum. 


Show that f has just two critical points—and that 


(b) The analogue of part (a) does not necessarily hold both of them are local maxima. 
for continuous functions of more than one variable, > (c) Use a computer to graph the function f in part (b). 


y 
| A 
kK x 4 


Figure 4.25 The open box of 
Example 1. 


4.3 Lagrange Multipliers 


Constrained Extrema 


Frequently, when working with applications of calculus, you will find that you 
do not need simply to maximize or minimize a function but that you must do so 
subject to one or more additional constraints that depend on the specifics of the 
situation. The following example is a typical situation: 


EXAMPLE 1 An open rectangular box is to be manufactured having a (fixed) 
volume of 4 ft?. What dimensions should the box have so as to minimize the 
amount of material used to make it? 

We’ll let the three dimensions of the box be independent variables x, y, and 
z, Shown in Figure 4.25. To determine how to use as little material as possible, 
we need to minimize the surface area function A given by 


A(x, y,zZ)= xy +2yz+ xz 


front and back sides bottom only 


For x, y, z > 0, this function has neither minimum nor maximum. However, we 
have not yet made use of the fact that the volume is to be maintained at a constant 
4 ft. This fact provides a constraint equation, 


V(x, y,zZ) = xyz = 4. 


The constraint is absolutely essential if we are to solve the problem. In particular, 
the constraint enables us to solve for z in terms of x and y: 


4 
xy 


We can thus create a new area function of only two variables: 


4 
a(x, y) =A (= y, 2) 


4 4 
= 2xy + 2y Pe Hx T 


=2xy+-—+-. 


Z= 


Now we can find the critical points of a by setting Da equal to 0: 


ða 8 
u 0 
ax y 2 
0 4 i 
ee ae 
dy y? 
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The first equation implies 


so that the second equation becomes 


x4 
2x —4{[—])=0 
i o 

1 
x(1- 53) =0. 


The solutions to this equation are x = 0 (which we reject) and x = 2. Thus, the 
critical point of a of interest is (2, 1), and the constrained critical point of the 
original function A is (2, 1, 2). 

We can use the Hessian criterion to check that x = 2, y = 1 yields a local 
minimum of a: 


16/x? 2 2 2 
Hats. = | | so Ha(2, D=f |: 


or, equivalently, 


2  8/y? 2 8 


The sequence of minors is 2, 12 so we conclude that (2, 1) does yield a local 
minimum of a. Because a(x, y) > oo as either x > 0*, y > 0", x > ov, or 
y — œ, we conclude that the critical point must yield a global minimum as well. 
Thus, the solution to the original question is to make the box with a square base 
of side 2 ft and a height of 1 ft. + 


The abstract setting for the situation discussed in Example 1 is to find max- 
ima or minima of a function f(x1, x2, ..., Xn) subject to the constraint that 
g(x1, X2, -.., Xn) = c for some function g and constant c. (In Example 1, the 
function f is A(x, y, z), and the constraint is xyz = 4.) One method for finding 
constrained critical points is used implicitly in Example 1: Use the constraint 
equation g(x) = c to solve for one of the variables in terms of the others. Then 
substitute for this variable in the expression for f(x), thereby creating a new 
function of one fewer variables. This new function can then be maximized or 
minimized using the techniques of §4.2. In theory, this is an entirely appropriate 
way to approach such problems, but in practice there is one major drawback: It 
may be impossible to solve explicitly for any one of the variables in terms of the 
others. For example, you might wish to maximize 


f(x,y, 2) =x + 3y + y*z4 


subject to 


XxX 
g(x, y, z) = e? — x y’z4 cos ( ) =2. 
yz 


There is no means of isolating any of x, y, or z on one side of the constraint 
equation, and so it is impossible for us to proceed any further along the lines of 
Example 1. 
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The Lagrange Multiplier 


The previous discussion points to the desirability of having another method for 
solving constrained optimization problems. The key to such an alternative method 
is the following theorem: 


THEOREM 3.1 Let X be open in R” and f, g: X — R be functions of class C!. 
Let S = {x € X | g(x) = c} denote the level set of g at height c. Then if f |s (the 
restriction of f to S) has an extremum at a point xo € S such that Vg(xo) Æ 0, 
there must be some scalar 4 such that 


V f (Xo) = AV g0). 


The conclusion of Theorem 3.1 implies that to find possible sites for extrema 
of f subject to the constraint that g(x) = c, we can proceed in the following 
manner: 

1. Form the vector equation V f(x) = AV g(x). 
2. Solve the system 


te = AVg(x) 


g(x) =c 
for x and A. When expanded, this is actually a system of n + 1 equations in 
n + 1 unknowns x1, X%2,...,X,, A, namely, 
falis X2, one Sa) = À8x (x1, X2, E Xn) 


So (x1, X2, KEN sln) = À8x (X1, X2, Cea | Xn) 


te, 1, X2,-+-. Xp) = Agx,(X1; X2,---, Xn) 
8(X1, X2, t e D) =c 


The solutions for x = (x1, X2, . . . , Xn) in the system above, along with any 
other points x satisfying the constraint g(x) = c and such that V f is unde- 
fined, or Vg vanishes or is undefined, are the candidates for extrema for the 
problem. 


3. Determine the nature of f (as maximum, minimum, or neither) at the critical 
points found in Step 2. 


The scalar à appearing in Theorem 3.1 is called a Lagrange multiplier, after 
the Italian-born French mathematician Joseph-Louis Lagrange (1736—1813) who 
first developed this method for solving constrained optimization problems. In 
practice, Step 2 can involve some algebra, so it is important to keep your work 
organized. (Alternatively, you can use a computer to solve the system.) In fact, 
since the Lagrange multiplier à is usually not of primary interest, you can avoid 
solving for it explicitly, thereby reducing the algebra and arithmetic somewhat. 
Determining the nature of a constrained critical point (Step 3) can be a tricky 
business. We’ll have more to say about that issue in the examples and discussions 
that follow. 


EXAMPLE 2 Let us use the method of Lagrange multipliers to identify the 
critical point found in Example 1. Thus, we wish to find the minimum of 


A(x, y, zZ) = 2xy + 2yz + xz 
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subject to the constraint 
V(x, y,Z) = xyz = 4. 
Theorem 3.1 suggests that we form the equation 
VA(x, y, Z) =AVV(x, y, Z). 


This relation of gradients coupled with the constraint equation gives rise to the 
system 


2y+ Z=Aayz 
2x + 2z=Axz 


2y+ x =Axy’ 
xyz=4 


Since A is not essential for our final solution, we can eliminate it by means of any 
of the first three equations. Hence, 


_2y+z 2x+2z  2y+x 
yz KZ 


Xr 


Simplifying, this implies that 


The first equality yields 


while the second equality implies that 


-=- or z=2y. 
z y 


Substituting these relations into the constraint equation xyz = 4 yields 


Qy)(y)(2y) = 4, 


so that we find that the only solution is y = 1, x = z = 2, which agrees with our 
work in Example 1. (Note that VV = 0 only along the coordinate axes, and such 
points do not satisfy the constraint V(x, y, z) = 4.) + 


An interesting consequence of Theorem 3.1 is this: By Theorem 6.4 of Chap- 
ter 2, we know that the gradient V g, when nonzero, is perpendicular to the level 
sets of g. Thus, the equation V f = AV g gives the condition for the normal vector 
to a level set of f to be parallel to that of a level set of g. Hence, for a point xo 
to be the site of an extremum of f on the level set S = {x | g(x) = c}, where 
Vg(xo) 4 0, we must have that the level set R of f that contains xq is tangent to 
S at Xo. 


EXAMPLE 3 Consider the problem of finding the extrema of f(x, y) = x7/4 
+ y? subject to the condition that x? + y? = 1. We let g(x, y) = x? + y’, and 
so the Lagrange multiplier equation V f(x, y) =AVg(x, y), along with the 
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Xe 


N 
ZZ 


Figure 4.26 The level sets of the 
function f(x, y) = x?/4 + y? 
define a family of ellipses. The 
extrema of f subject to the 
constraint that x? + y? = 1 (i.e., 
that lie on the unit circle) occur at 
points where an ellipse of the 
family is tangent to the unit circle. 


A 


constraint equation, yields the system 


X 


= = 2)x 
2 

2y=2dy - 
xX + y =l 


(There are no points simultaneously satisfying g(x, y) = 1 and Vg(x, y) = 
(0, 0).) The first equation of this system implies that either x = 0 or A = L. 
If x = 0, then the second two equations, taken together, imply that y = +1 and 
A=1.1fA= 1, then the second two equations imply y = 0 and x = +1. There- 
fore, there are four constrained critical points: (0, +1), corresponding to A = 1, 
and (+1, 0), corresponding to à = i. 

We can understand the nature of these critical points by using geometry and 
the preceding remarks. The collection of level sets of the function f is the family 
of ellipses x?/4 + y? = k whose major and minor axes lie along the x- and y- 
axes, respectively. In fact, the value f(x, y) = x?/4+ y? =k is the square of 
the length of the semiminor axis of the ellipse x7/4 + y? = k. The optimization 
problem then is to find those points on the unit circle x? + y? = 1 that, when 
considered as points in the family of ellipses, minimize and maximize the length 
of the minor axis. When we view the problem in this way, we see that such points 
must occur where the circle is tangent to one of the ellipses in the family. A sketch 
shows that constrained minima of f occur at (+1, 0) and constrained maxima at 
(0, +1). In this case, the Lagrange multiplier A represents the square of the length 
of the semiminor axis. (See Figure 4.26.) + 


EXAMPLE 4 Consider the problem of determining the extrema of f(x, y) = 
2x + y subject to the constraint that ./x + ./y = 3. We let g(x, y) = /x + JY, 
so that the Lagrange multiplier equation V f(x, y) = AV g(x, y), along with the 
constraint equation, yields the system 


a. 
(Vk 
i= 
EF 

Vx +./¥ =3 


The first two equations of this system imply that A = 4,/x = 2,/y so that ./y = 
2,/x. Using this in the last equation, we find that 3,/x = 3 and, hence, x = 1. 
Thus, the system of equations above yields the unique solution (1, 4). 

Since the constraint defines a closed, bounded curve segment, the extreme 
value theorem (Theorem 2.5) applies to guarantee that f must attain both a 
global maximum and a global minimum on this segment. However, the Lagrange 
multiplier method has provided us with just a single critical point. But note that the 
points (9, 0) and (0, 9) satisfy the constraint Vx + „/y = 3; they are both points 
where Vg is undefined. Moreover, we have f(1, 4) = 2, while f(9, 0) = 18 and 
f(0, 9) = 9. Evidently then, the minimum of f occurs at (1, 4) and the maximum 
at (9, 0). 

We can understand the geometry of the situation in the following manner. The 
collection of level sets of the function f is the family of parallel lines 2x + y = k. 
Note that the height k of each level set is just the y-intercept of the corresponding 
line in the family. Thus, the problem we are considering is to find the largest and 


Vg (Xo) 


Figure 4.28 The gradient 

V g(Xo) is perpendicular to 

S = {x | g(x) = c}, hence, to the 
tangent vector at xp to any curve 
x(t) lying in S and passing through 
xq. If f has an extremum at Xo, 
then the restriction of f to the 
curve also has an extremum at Xo. 


4.3 | Lagrange Multipliers 283 


Figure 4.27 The level sets of the function f(x, y) = 2x + y define a family 
of lines. The minimum of f subject to the constraint that ./x + ./y = 3 
occurs at a point where one of the lines is tangent to the constraint curve and 
the maximum at one of the endpoints of the curve. 


smallest y-intercepts of any line in the family that meets the curve yx + ./y = 3. 
These extreme values of k occur either when one of the lines is tangent to the 
constraint curve or at an endpoint of the curve. (See Figure 4.27.) 

This example illustrates the importance of locating all the points where ex- 
trema may occur by considering places where V f or Vg is undefined (or where 
Vg = 0) as well as the solutions to the system of equations determined using 
Lagrange multipliers. + 


Sketch of a proof of Theorem 3.1 We present the key ideas of the proof, which 
are geometric in nature. Try to visualize the situation for the case n = 3, where 
the constraint equation g(x, y, z) = c defines a surface S in R*. (See Figure 4.28.) 
In general, if S is defined as {x | g(x) = c} with V (xo) # 0, then (at least locally 
near xo) S is a hypersurface in R”. The proof that this is the case involves the 
implicit function theorem (Theorem 6.5 in §2.6), and this is why our proof here 
is just a sketch. 

Thus, suppose that xg is an extremum of f restricted to S. We consider a 
further restriction of f—to a curve lying in S and passing through x9. This will 
enable us to use results from one-variable calculus. The notation and analytic 
particulars are as follows: Let x: Z C R > S c R? be a C! path lying in S with 
X(to) = Xo for some tọ € J. Then the restriction of f to x is given by the function 
F, where 


F(t) = f(x@). 


Because xo is an extremum of f on S, it must also be an extremum on x. Conse- 
quently, we must have F’(to) = 0, and the chain rule implies that 


d 
0 = F'() = at Dliz = VF (x(t) x(t) = Vf (Xo) +x'(o). 


Thus, V f(Xo) is perpendicular to any curve in S passing through xo; that is, 
V f (Xo) is normal to S at xo. We’ve seen previously in §2.6 that the gradient 
Vg(Xo) is also normal to S at xo. Since the normal direction to the level set S is 
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uniquely determined and V g(xo) 4 0, we must conclude that V f (xo) and V g(xo) 
are parallel vectors. Therefore, 


V f (Xo) = AV g0) 


for some scalar à € R, as desired. a 


The Case of More than One Constraint 


It is natural to generalize the situation of finding extrema of a function f subject 
to a single constraint equation to that of finding extrema subject to several con- 
straints. In other words, we may wish to maximize or minimize f subject to k 
simultaneous conditions of the form 


gi(x) =c 
82(X) = C2 
8k(X) = Ce 


The result that generalizes Theorem 3.1 is as follows: 


THEOREM 3.2 Let X be open in R” and let f, g1, ..., g: X C R” > Rbe C! 
functions, where k < n. Let S = {x € X | 81) = c1, ... , o¢(x) = cx}. If f |s has 
an extremum at a point Xo, where V g1 (X0), ..., Vgx(Xo) are linearly independent 
vectors, then there must exist scalars 1, ..., A, such that 


V f (Xo) = A1 V81 (X0) + 2V 82X0) +--+ + Ak V 8X0). 


(Note: k vectors v;,..., Vg in R” are said to be linearly independent if the only 
way to satisfy a1Vı + ---+ agv, = 0 for scalars a1, ... , a, is ifai = a =- - = 
ak = 0.) 


Idea of proof First, note that S is the intersection of the k hypersurfaces $1, ..., 
Sk, where S; = {x € R” | g;(x) = cj}. Therefore, any vector tangent to $ must 
also be tangent to each of these hypersurfaces, and so, by Theorem 6.4 of 
Chapter 2, perpendicular to each of the Vg;’s. Given these remarks, the main 
ideas of the proof of Theorem 3.1 can be readily adapted to provide a proof of 
Theorem 3.2. 

Therefore, we let x) € S be an extremum of f restricted to S and consider 
the one-variable function obtained by further restricting f to a curve in S through 
Xo. Thus, let x: 7 > S c R” bea C! curve in S with x(to) = Xo for some to € T. 
Then, as in the proof of Theorem 3.1, we define F by 


F(t) = fa). 
It follows, since x9 is assumed to be a constrained extremum, that 
F'(to) = 0. 
The chain rule then tells us that 
0 = F'(to) = V f(x(to)) + X'(to) = V f (Xo) + x'(to). 


That is, V f (xo) is perpendicular to all vectors tangent to S at xo. Therefore, it can 
be shown that V f (xo) is in the k-dimensional plane spanned by the normal vectors 


Plane spanned by 
Vgi(Xo) and 


Figure 4.29 Illustration of the 
proof of Theorem 3.2. The 
constraints gı (x) = cı and 

82(X) = c2 are the surfaces Sı and 
S2. Any extremum of f must occur 
at points where V f is in the plane 
spanned by Vg, and Vg. 
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to the individual hypersurfaces $1, ..., 5; whose intersection is S. It follows (via 
a little more linear algebra) that there must be scalars 41, ..., Ax such that 


V f (Xo) = 41V 81 (Xo) + A2V go(Xo) ++ ++ + ARV 8X0). 


A suggestion of the geometry of this proof is provided by Figure 4.29 (where 
k = 2 andn = 3). E 


EXAMPLE 5 Suppose the cone z? = x? + y? is sliced by the plane z = x + 
y +2 so that a conic section C is created. We use Lagrange multipliers to find 
the points on C that are nearest to and farthest from the origin in R°. 

The problem is to find the minimum and maximum distances from (0, 0, 0) 
of points (x, y, z) on C. For algebraic simplicity, we look at the square of the 
distance rather than the actual distance. Thus, we desire to find the extrema of 

f(x,y,z) =x +y +z 


(the square of the distance from the origin to (x, y, z)) subject to the constraints 


gia, y, z =x +y az= 
ga, y z =x+y-z=-2 | 


Note that 
Vei(x, y, z) = (2x, 2y, —2z) and Vgo(x, y,z)=(,1,-1). 


These vectors are linearly dependent only when x = y = z. However, no point of 
the form (x, x, x) simultaneously satisfies g; = 0 and g3 = —2. Hence, Vg; and 
Vez are linearly independent at all points that satisfy the two constraints. There- 
fore, by Theorem 3.2, we know that any constrained critical points (xo, yo, Zo) 
must satisfy 


V f (Xo, Yo, Zo) = A1 Vg8i(Xo, Yo, Zo) + à2V 82(x0, Yo, Zo), 


as well as the two constraint equations. Thus, we must solve the system 


2x = 21x + à2 
2y = 21y + %2 
2z = —2)ız — à2 . 
ve+ty—2=0 
x+ty-z=-2 


Eliminating i, from the first two equations yields 
Ag = 2x —2)\x = 2y — 2Ayy, 
which implies that 
2(x — y)(l — A1) = 0. 
Therefore, either 
x=y or =l. 


The condition 4; = | implies immediately A2 = 0, and the third equation of the 
system becomes 2z = —2z, so z must equal 0. If z = 0, then x and y must be 
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Figure 4.30 The 
point a, is the point 
on the hyperbola 


closest to the origin. 


The point ap is the 
point on the lower 
branch of the 
hyperbola closest 
to the origin. 


zero by the fourth equation. However, (0, 0, 0) is not a point on the plane z = 
x + y + 2. Thus, the condition A; = 1 leads to no critical point. On the other hand, 
if x = y, then the constraint equations (the last two in the original system of five) 
become 


2x? z7=0 
22S 2 
Substituting z = 2x + 2 yields 
2x? — (2x +2} =0, 
equivalent to 
2x? +8x+4=0, 


whose solutions are x = —2 + J/2. Therefore, there are two constrained critical 
points 


ai = (2+ v2, 2+ v2, 2+2V2) 


and 


a = (-2 J2, -2-2,2 2v2). 
We can check that 
f(a) = 24 — 16/2, f (ar) = 24 + 162, 


so it seems that a; must be the point on C lying nearest the origin, and ay must be 
the point that lies farthest. However, we don’t know a priori if there is a farthest 
point. If the conic section C is a hyperbola or a parabola, then there is no point 
that is farthest from the origin. To understand what kind of curve C is, note 
that a, has positive z-coordinate and a has negative z-coordinate. Therefore, the 
plane z = x + y + 2 intersects both nappes of the cone z? = x? + y?. The only 
conic section that intersects both nappes of a cone is a hyperbola. Hence, C is a 
hyperbola, and we see that the point a; is indeed the point nearest the origin, but 
the point az is not the farthest point. Instead, az is the point nearest the origin on 
the branch of the hyperbola not containing a. That is, local constrained minima 
occur at both a; and a), but only a; is the site of the global minimum. (See 
Figure 4.30.) + 


A Hessian Criterion for Constrained Extrema (optional) — 


As Example 5 indicates, it is often possible to determine the nature of a critical 
point (constrained or unconstrained) from considerations particular to the prob- 
lem at hand. Sometimes this is not difficult to do in practice and can provide 
useful insight into the problem. Nonetheless, occasionally it is advantageous to 
have a more automatic means of discerning the nature of a constrained critical 
point. We therefore present a Hessian criterion for constrained critical points. 
Like the one in the unconstrained case, this criterion only determines the local 
nature of a critical point. It does not provide information about global constrained 
extrema.” 


2 We invite the reader to consult D. Spring, Amer. Math. Monthly, 92 (1985), no. 9, 631—643 for a more 
complete discussion. 
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In general, the context for the Hessian criterion is this: We seek extrema of a 
function f: X C R” — R subject to the k constraints 


Eii Ayee Xn) = C 

82(X1,X2,---,Xn) = C2 

EKI, X2, -05 Xn) = Ck 
We assume that f, g1,..., gą are all of class C 2 and assume, for simplicity, that 
f and the g;’s all have the same domain X. Finally, we assume that V g1, . . . , V 8% 


are linearly independent at the constrained critical point a. Then, by Theorem 3.2, 
any constrained extremum a must satisfy 


V f(a) = A1V gi(a) + à2Vg2(a) +++ + ARV g(a) 


for some scalars 4;,..., Az. We can consider a constrained critical point to be a 
pair of vectors 


(A; a) = (Ài, . a gulp d é ..5n) 
satisfying the aforementioned equation. In fact, we can check that (A; a) is an 


unconstrained critical point of the so-called Lagrangian function L defined 
by 


k 
Lh, ...5 les 41, --+5%n) = f(X1,---, Xn) — X bleie, ..+5Xn) — Ci). 
i=l 


The Hessian criterion comes from considering the Hessian of L at the critical 
point (A; a). Before we give the criterion, we note the following fact from linear 


algebra: Since Vg;(a), ..., Vg,(a) are assumed to be linearly independent, the 
derivative matrix of g = (81, ..., gy) ata, 

O81 O81 

m a E 

TA ) an, (a) 

Dg(a) = : ae ; 3 

98k I8k 

——— a eee ——— 

Jx, ) Jx, (a) 


hasak x k submatrix (obtained by deletingn — k columns of Dg(a)) with nonzero 
determinant. By relabeling the variables if necessary, we will assume that 


O81 Og) 
| w z ea | 
det : E : #0 


dgk dgk 
Jx (a) axe (a) 


(i.e., that we may delete the /ast n — k columns). 
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Second derivative test for constrained local extrema. Given a constrained 
critical point a of f subject to the conditions g)(x) = c1, 82(X) =c,..., 
2k(X) = Cx, consider the matrix 


O21 _ 981 


0 shire 0 ee (ley iene a 
ax ) aa ) 
OE ð 
0 T 0 = ata) SL 
HL(;a) = : Ox) OXn 
§1 &k 
—— —— (a h hin 
n a A 
81 dgk | 
LaL S eels hn ae hnn 
| T. (a) ae (a) 1 
where 
Pf dg) 0° go 0° gx 
i a a a)—----i a). 
7 TE ) T ) a ) Tape ) 


(Note that H L(A; a) is an (n + k) x (n +k) matrix.) By relabeling the vari- 
ables as necessary, assume that 


81 O81 
ee ee 
det : Ze == 0. 
O8k OLk 
are ARE To 


As in the unconstrained case, let H; be the upper leftmost j x j subma- 
trix of HL(A, a). For j = 1,2,...,k nera = det H;, and calculate the 
following sequence of n — k numbers: 

Cl oe) oss Y ee (1) 


Note that, if k > 1, the sequence in (1) is not the complete sequence of prin- 
cipal minors of HL(A, a). Assume dk, = det HL(A, a) Æ 0. The numerical 
test is as follows: 


1. If the sequence in (1) consists entirely of positive numbers, then f has a 
local minimum at a subject to the constraints 
SCX) = Ci, RA) Sc... gA) = Ck. 
2. Ifthe sequence in (1) begins with a negative number and thereafter alter- 
nates in sign, then f has a local maximum at a subject to the constraints 
SCX) =c, 82(X) = C2,..., A) = Ck. 


3. If neither case | nor case 2 holds, then f has a constrained saddle point 
at a. 


In the event that det H L(A, a) = 0, the constrained critical point a is degen- 
erate, and we must use another method to determine whether or not it is the 
site of an extremum. 
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Finally, in the case of no constraint equations g;(x) = c; (i.e., k = 0), the 
preceding criterion becomes the usual Hessian test for a function f ofn variables. 
EXAMPLE 6 In Example 1, we found the minimum of the area function 

A(x, y, Zz) = 2xy + 2yz + xz 
of an open rectangular box subject to the condition 
V(x, y, z) = xyz = 4. 


Using Lagrange multipliers, we found that the only constrained critical point was 
(2, 1, 2). The value of the multiplier à corresponding to this point is 2. To use the 
Hessian criterion to check that (2, 1, 2) really does yield a local minimum, we 
construct the Lagrangian function 


L(x, y, z) = A(x, y, z) — UV, y, z) — 4) 
= 2xy+2yz+xz-—I(xyz — 4). 


Then 
0 —yz —XZ —xy 
eae Ee ee ve a 
—xy l—ly 2-Ix 0 
At the constrained critical point (2; 2, 1, 2), we have 
0 —2 —4 -2 
HL(2;2,1,2)=| 4 s 7 > 
—2 -1 —2 0 
The sequence of determinants to consider is 
0 —-2 —4 
(—1)' det hazı = —det}] —2 0 —2 | =32, 
—4 —2 0 
0 -2 —4 -2 
(—1)! det Hy = — det i E E E = 48. 
—2 -1 —2 0 


Since these numbers are both positive, we see that (2, 1, 2) indeed minimizes the 
area of the box subject to the constant volume constraint. + 


EXAMPLE 7 In Example 5, we found points on the conic section C defined 
by equations 
gia, y, z) =x +y -z = 0 
g(x, y, z) =x +y-z= -2 
that are (constrained) critical points of the “distance” function 
fx, y, z) =x +y +z. 


To apply the Hessian criterion in this case, we construct the Lagrangian function 


L(l,m3x, y, z) =x? y? Hz? I(x? y? z?) m(x+y—z+2). 
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The critical points of L, found by setting DL(/, m; x, y, z) equal to 0, are 


(à1;a1) = (3 + 2V2, —24 + 164/2 ; —2 + 4/2, —2 + V2, —2 + 2/2) 


and 
(à2;a2) = (—3 — 2V2, —24 — 164/2 ; —2 — V2, —2 — V2, — 2 — 24/2). 
The Hessian of L is 
0 0 —2x —2y 2z 
0 0 —1 —1 1 
H L(l, m; x, y, z) = 2x —1 2-2! 0 0 


—2y il 0 2—21 0 
2z 1 0 0 2421 


After we evaluate this matrix at each of the critical points, we need to compute 


(—1)* det Ha(2)41 = det Hs. 


We leave it to you to check that for (A;;.a1) this determinant is 128 — 64/2 ~ 
37.49, and for (Az; a2) it is 128 + 64/2 © 218.51. Since both numbers are pos- 
itive, the points (—2 + /2, —2 + J/2, —2 +22) are both sites of local min- 
ima. By comparing the values of f at these two points, we see that (—2 + V2, 
—2 + /2, —2 + 2./2) must be the global minimum. + 


4.3 Exercises 


1. In this problem, find the point on the plane 2x — 3y — 
z = 4 that is closest to the origin in two ways: 
(a) by using the methods in §4.2 (i.e., by finding the 
minimum value of an appropriate function of two 
variables); 


(b) by using a Lagrange multiplier. 


In Exercises 2-12, use Lagrange multipliers to identify the crit- 
ical points of f subject to the given constraints. 


2. f(x, y)=y, 2x? +y =4 

3. f(x, y)=5x+2y, 5x? +2y = 14 

4. f(x, y)=xy, 2x—3y=6 

5. fŒ, y,z)=xyz, 2x +3y+z=6 

6. f(x,y,z) =x +y +2, xty-z=1 

7. f(x,y, 2=3—-x2-2y?— 2, 2x+y+z=2 
8. fix, y, z) =x +y +2, x +y +z =6 

9. f(x,y,z =2x +y- z, x—2y=0,x+z=0 


10. f(x,y,z) =2x +y? +2z, x? -y =l, x+y+ 
z=2 


11. f(x,y,z) =xy +yz, x +y =l, yz=1 


12. fix,y,ja=xtytz yrl, x4+2z2=1 


13. (a) Find the critical points of f(x, y) = x? + y sub- 
ject to x? + 2y? = 1. 
(b) Use the Hessian criterion to determine the nature 
of the critical point. 


14. (a) Find any critical points of f(x, y, z, w) = x? + 
y? +z? + w? subject to 2x + y+z=1,x — 2z — 
w=-2,3x+y+2w=-l. 

(b) Use the Hessian criterion to determine the nature 
of the critical point. (Note: You may wish to use a 
computer algebra system for the calculations.) 


Just as sometimes is the case when finding ordinary (i.e., un- 
constrained) critical points of functions, it can be difficult to 
solve a Lagrange multiplier problem because the system of 
equations that results may be prohibitively difficult to solve by 
hand. In Exercises 15-19, use a computer algebra system to 
find the critical points of the given function f subject to the 
constraints indicated. (Note: You may find it helpful to provide 
numerical approximations in some cases.) 


@ 15. SQ, y, Z) = 3xy — 4z,3x + y—2xz=1 


Q 16. f(x,y, 2) = 3xy —4yzt5xz, 3x +y +22 = 12, 
2x —3y+5z2=0 


Q 17. 
@ 18. 
@ 19. 


20. 


f(x, y, z) = y? + 2xyz—3x* x7 +y +2 =1 


fa y z) =x +y -az xy +z =1 


fyz w) =x Hy, 
x+y+z+w=l,x—-y+z-w=0 


Consider the problem of determining the extreme val- 

ues of the function f(x, y) = x? + 3y? subject to the 

constraint that xy = —4. 

(a) Usea Lagrange multiplier to find the critical points 
of f that satisfy the constraint. 

(b) Give an analytic argument to determine if the criti- 
cal points you found in part (a) yield (constrained) 
maxima or minima of f. 


Q (c) Use a computer to plot, on a single set of axes, sev- 


21. 


22. 


23. 


24. 


25. 


eral level curves of f together with the constraint 
curve xy = —4. Use your plot to give a geometric 
justification for your answers in parts (a) and (b). 


Find three positive numbers whose sum is 18 and 
whose product is as large as possible. 


Find the maximum and minimum values of 
f(x,y,z) =x + y — z on the sphere x? + y? +z? = 
81. Explain how you know that there must be both a 
maximum and a minimum attained. 


Find the maximum and minimum values of f(x, y) = 
x? + xy + y? onthe closed disk D = {(x, y) | x? + y? 
<4). 


You are sending a birthday present to your calculus in- 
structor. Fly-By-Night Delivery Service insists that any 
package it ships be such that the sum of the length plus 
the girth be at most 108 in. (The girth is the perimeter of 
the cross section perpendicular to the length axis—see 
Figure 4.31.) What are the dimensions of the largest 
present you can send? 


Figure 4.31 Diagram for 
Exercise 24. 


A cylindrical metal can is to be manufactured from 
a fixed amount of sheet metal. Use the method of 
Lagrange multipliers to determine the ratio between 
the dimensions of the can with the largest capacity. 


26. 


27. 


28. 


29. 


30. 
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An industrious farmer is designing a silo to hold her 
9007 ft? supply of grain. The silo is to be cylindrical 
in shape with a hemispherical roof. (See Figure 4.32.) 
Suppose that it costs five times as much (per square 
foot of sheet metal used) to fashion the roof of the silo 
as it does to make the circular floor and twice as much 
to make the cylindrical walls as the floor. If you were 
to act as consultant for this project, what dimensions 
would you recommend so that the total cost would be 
a minimum? On what do you base your recommen- 
dation? (Assume that the entire silo can be filled with 
grain.) 


Figure 4.32 The grain silo 
of Exercise 26. 


You are in charge of erecting a space probe on the 
newly discovered planet Nilrebo. To minimize interfer- 
ence to the probe’s sensors, you must place the probe 
where the magnetic field of the planet is weakest. Nil- 
rebo is perfectly spherical with a radius of 3 (where 
the units are thousands of miles). Based on a coordi- 
nate system whose origin is at the center of Nilrebo, 
the strength of the magnetic field in space is given by 
the function M(x, y, z) = xz — y? + 3x +3. Where 
should you locate the probe? 


Heron’s formula for the area of a triangle whose sides 
have lengths x, y, and z is 


Area = y's(s x\(s — y)(s — z), 


where s = 5(x +y+z) is the so-called semi- 
perimeter of the triangle. Use Heron’s formula to show 
that, for a fixed perimeter P, the triangle with the 
largest area is equilateral. 


Use a Lagrange multiplier to find the largest sphere 
centered at the origin that can be inscribed in the ellip- 
soid 3x? + 2y? + z? = 6. (Be careful with this prob- 
lem; drawing a picture may help.) 


Find the point closest to the origin and on the line 
of intersection of the planes 2x + y+3z=9 and 
3x+2y+z=6. 


ave 


31 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 
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. Find the point closest to the point (2, 5, — 1) and on the 
line of intersection of the planes x — 2y + 3z = 8 and 
22> y= 3. 


The plane x + y+ z= 4 intersects the paraboloid 
z= x? + y? in an ellipse. Find the points on the el- 
lipse nearest to and farthest from the origin. 


Find the highest and lowest points on the ellipse ob- 
tained by intersecting the paraboloid z = x? + y? with 
the plane x + y+ 2z = 2. 


Find the minimum distance between a point on the 
ellipse x? + 2y? = 1 andapoint onthe line x + y = 4. 
(Hint: Consider a point (x, y) on the ellipse and a point 
(u, v) on the line. Minimize the square of the distance 
between them as a function of four variables. This prob- 
lem is difficult to solve without a computer.) 


(a) Use the method of Lagrange multipliers to find crit- 
ical points of the function f(x, y) = x + y subject 
to the constraint xy = 6. 


(b) Explain geometrically why f has no extrema on 
the set {(x, y) | xy = 6}. 


Leta, 8, and y denote the (interior) angles ofa triangle. 
Determine the maximum value of sing sin £ sin y. 


Let S be a surface in R? given by the equation 
g(x, y, z) = c, where g is a function of class C! with 
nonvanishing gradient and c is a constant. Suppose that 
there is a point P on S whose distance from the origin 
is amaximum. Show that the displacement vector from 
the origin to P must be perpendicular to S. 


The cylinder x? + y? = 4 and the plane 2x + 2y + 
z = 2 intersect in an ellipse. Find the points on 
the ellipse that are nearest to and farthest from the 
origin. 

Find the points on the ellipse 3x? — 4xy + 3y? = 50 
that are nearest to and farthest from the origin. 


This problem concerns the determination of the ex- 
trema of f(x, y) = /x + 8,/y subject to the con- 
straint x? + y? = 17, where x > 0 and y > 0. 

(a) Explain why f must attain both a global minimum 
and a global maximum on the given constraint 
curve. 

(b) Use a Lagrange multiplier to solve the system of 
equations 


, 


ee y) =AVa(x, y) 
g(x, y)=0 


where g(x, y) = x? + y?. You should identify a 
single critical point of f. 

(c) Identify the global minimum and the global max- 
imum of f subject to the constraint. 


41. 


Consider the problem of finding extrema of f(x, y) = 

x subject to the constraint y? — 4x7 + 4x4 = 0. 

(a) Use a Lagrange multiplier and solve the system of 
equations 


’ 


pee y) =AVa(x, y) 
g(x,y) =0 
where g(x, y) = y? — 4x3 + 4x4. By doing so, 


you will identify critical points of f subject to the 
given constraint. 


Q (b) Graph the curve y? — 4x? + 4x4 = 0 and use the 


graph to determine where the extrema of f (x, y) = 
x occur. 


(c) Compare your result in part (a) with what you 
found in part (b). What accounts for any differ- 
ences that you observed? 


42. Consider the problem of finding extrema of 


43. 


f(x, y, z) = x? + y? subject to the constraint z = c, 

where c is any constant. 

(a) Use the method of Lagrange multipliers to iden- 
tify the critical points of f subject to the constraint 
given above. 


(b) Using the usual alphabetical ordering of variables 
(i.e., x1 =x, X2 = y, X3 = Z), construct the Hessian 
matrix H L(à;a1, a2, a3) (where L(l; x,y,z) = 
f(x, y,z)— I(z — c)) for each critical point you 
found in part (a). Try to use the second deriva- 
tive test for constrained extrema to determine the 
nature of the critical points you found in part (a). 
What happens? 


(c) Repeat part (b), this time using the variable order- 
ing xı = Z, X2 = y, x3 = x. What does the second 
derivative test tell you now? 

(d) Without making any detailed calculations, discuss 
why f must attain its minimum value at the point 
(0, 0, c). Then try to reconcile your results in parts 
(b) and (c). This exercise demonstrates that the as- 
sumption that 


981 agi 
Ix (a) > Ox, (a) 
i ea a 
Ik dgk 
ax? an? 


is important. 


Consider the problem of finding critical points of the 
function f(x1,...,Xn) subject to the set of k con- 


straints 
Bi(X1,---5 Xn) = C1, Bal Cires s Ly) HC, wos 
R(X, tee Xe) = Ck. 


Assume that f, g1, @2,..., g are all of class C?. 


Height 


(a) Show that we can relate the method of Lagrange 
multipliers for determining constrained critical 
points to the techniques in §4.2 for finding un- 
constrained critical points as follows: If 

(A,a)=(1,.. 


ss ASL, ++ +5 Mn) 


is a pair consisting of k values for Lagrange mul- 
tipliers A,,..., A, and n values a), ..., an for the 
variables x1, ...,X„ such that a is a constrained 
critical point, then (A, a) is an ordinary (i.e., un- 
constrained) critical point of the function 

Lise: 


salk; Xi seyn) 


k 
= f (Xi, -3 Xn) — POLE, +, Xn) — ci). 
man] 


(b) Calculate the Hessian H L(A, a), and verify that it 
is the matrix used in §4.3 to provide the criterion 
for determining the nature of constrained critical 
points. 


44. 


45. 
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The unit hypersphere in R” (centered at the origin 
0 = (0, ... , 0)) is defined by the equation Xi + x + 
EE x = 1. Find the pair of points x = (x,..., Xn) 
and y = (y1,..-, Yn), each of which lies on the unit 
hypersphere, that maximizes and minimizes the func- 
tion 


n 
IR ethic Ya) = J kiyi. 
i=1 


What are the maximum and minimum values of f? 


Let x = (x1, .. ., Xn) andy = (),..., Yn) be any vec- 
tors in R” and, fori = 1,..., n, set 
a y; 
u; = ——— and v = ———.. 
2 / 2 
ae ae Yi 


(a) Show that u = (u1, ..., Un) and v = (v,.. 
lie on the unit hypersphere in R”. 

(b) Use the result of Exercise 44 to establish the 
Cauchy—Schwarz inequality 


«5 Un) 


x+y] < Ixl] Ilyll. 


4.4 Some Applications of Extrema 


In this section, we present several applications of the methods for finding both 
constrained and unconstrained extrema discussed previously. 


Least Squares Approximation 


° The simplest relation between two quantities x and y is, without doubt, a linear 


one: y = mx + b (where m and b are constants). When a biologist, chemist, 


° psychologist, or economist postulates the most direct connection between two 


types of observed data, that connection is assumed to be linear. Suppose that Bob 
š Biologist and Carol Chemist have measured certain blood protein levels in an 


adult population and have graphed these levels versus the heights of the subjects 


Figure 4.33 Height versus protein 


level. 


Height 


Protein level 


as in Figure 4.33. If Prof. Biologist and Dr. Chemist assume a linear relationship 
between the protein and height, then they desire to pass a line through the data as 
closely as possible, as suggested by Figure 4.34. 

To make this standard empirical method of linear regression precise (in- 


stead of merely graphical and intuitive), we first need some notation. Suppose we 


Figure 4.34 Fitting a line to 


Protein level 


the data. 


D(m, b) = [yı 


have collected n pairs of data (x1, y1), (x2, y2), ---, (Xn, Yn). (In the example just 
described, x; is the protein level of the ith subject and y; his or her height.) We 
assume that there is some underlying relationship of the form y = mx + b, and 
we want to find the constants m and b so that the line fits the data as accurately 
as possible. Normally, we use the method of least squares. The idea is to find 
the values of m and b that minimize the sum of the squares of the differences 
between the observed y-values and those predicted by the linear formula. That is, 
we minimize the quantity 


(mx, + b)P + [y2 — (mx2 + b)? 


H [Yn (MXn H b), (1) 
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Line y=mx+b 


mx; + b+ |_— Distance is 
|y: — (mx; + b)| 


(x; yi) 


Figure 4.35 The method of least squares. 


where, fori = 1,...,”, y; represents the observed y-value of the data, and 
mx; + b represents the y-value predicted by the linear relationship. Hence, each 
expression in D of the form y; — (mx; + b) represents the error between the 
observed and predicted y-values. (See Figure 4.35.) They are squared in the 
expression for D in order to avoid the possibility of having large negative 
and positive terms cancel one another, thereby leaving little or no “net error,” 
which would be misleading. Moreover, D(m, b) is the square of the distance 
in R” between the point (y1, y2,..., Yn) and the point (mx; + b, mx. +b,..., 
mMXn + b). 

Thus, we have an ordinary minimization problem at hand. To solve it, we 
need to find the critical points of D. First, we can rewrite D as 


D(m, b) = X [yi — (mx; + b)? 
i=l 


= 9 — 2m a = oy yi + Cr +b’. 
i=l i=l i=l i=l 


Then 
z E = XiYi + 3 2(mx; + b)xi 
Z By ns í mY + 2b x 
and E = i=l 


2(mx; + b) 
1 


dD Z 
3b = 2 Ta 


=-2 5 yi + 2m Sa +2nb. 
i=l i 


i=] 


n 


When we set both partial derivatives equal to zero, we obtain the following pair 
of equations, which have been simplified slightly: 


(2x7) m+ (X xi)b = X xiyi 
($ xi)m +nb => yi . 


(All sums are taken from i = 1 to n.) Although (2) may look complicated, it is 
nothing more than a linear system of two equations in the two unknowns m and b. 


(2) 
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It is not difficult to see that system (2) has a single solution. Therefore, we have 
shown the following: 


PROPOSITION 4.1 Given n data points (x1, y1), (x2, Y2), ---; (Xn, Yn) With not 
all of x1, x2,...,X, equal, the function 


n 


D(m, b) = $ [yi — (mx; + BYP 


i=l 

has a single critical point (mo, bo) given by 

= n D XiYi — D9 aCe yi) 
n> x? — © xi)” 


and 
EAE- EE) 
ný x? — (> xi)” 


by = 


Since D(m, b) is a quadratic polynomial in m and b, the graph ofz = D(m, b) 
is a quadric surface. (See $2.1.) The only such surfaces that are graphs of functions 
are paraboloids and hyperbolic paraboloids. We show that, in the present case, 
the graph is that of a paraboloid by demonstrating that D has a local minimum at 
the critical point (mo, bo) given in Proposition 4.1. 

We can use the Hessian criterion to check that D has a local minimum at 
(mo, bo). We have 


ya 2% 


H D(m, b) = 
2f xi 2n 


The principal minors are 2 X x? and 4n © x? — 4X} x). The first minor is 
obviously positive, but determining the sign of the second requires a bit more 
algebra. (If you wish, you can omit reading the details of this next calculation 
and rest assured that the story has a happy ending.) Ignoring the factor of 4, we 


examine the expression n } x? — ($ xj). Expanding the second term yields 


2 
n n n n 
ny #-() x) =n) #-() a+) 2an) 
i=l i=l i=l i=l 


i<j 
=(n-1)> x -9 2x;x;. (3) 
i=l i<j 
On the other hand, we have 
2a -xf = » (x? — 2xixj +x?) = (n — 1) Sox? — eee (4) 
i<j i<j i=l i<j 
(To see that equation (4) holds, you need to convince yourself that 


D> (x? +.x3) =(n— DÝ? 
i=l 


i<j 


296 Chapter 4 | Maxima and Minima in Several Variables 


Best fit 
line 


Figure 4.36 Data for the linear 
regression of Example 1. 


by counting the number of times a particular term of the form Xe appears in the 
left-hand sum.) Thus, we have 


2 
det H D(m, b) = 4 | nY x? — (>>) 
i=l i=l 
= a(o — 1) ee — = >i) by equation (3), 
i=l 


i<j 


=4 Gi —x y by equation (4). 


i<j 


Because this last expression is a sum of squares, it is nonnegative. Therefore, the 
Hessian criterion shows that D does indeed have a local minimum at the critical 
point. Hence, the graph of z = D(m, b) is that of a paraboloid. Since the (unique) 
local minimum of a paraboloid is in fact a global minimum (consider a typical 
graph), we see that D is indeed minimized at (mọ, bo). 


EXAMPLE 1 To see how the preceding discussion applies to a specific set of 
data, consider the situation depicted in Figure 4.36. 
We have n = 5, and the function D to be minimized is 


D(m, b) = [2 — (m + b) + [1 — (2m + b)P + [5 — Bm + b)P 
+ [3 — (4m + b)? + [4 — (5m + bY}. 


We compute 


Jasi a = yas, oxy: = 51. 


Thus, using Proposition 4.1, 


5-51— 15-15 3 55-15- 15.51 6 
OS paea ~ 5.55—15.15 5 
The best fit line in terms of least squares approximation is 
y= PT + 
5 5 


Of course, linear regression is not always an appropriate technique. It may 
not be reasonable to assume that the data points fall nearly on a straight line. 
Some formula other than y = mx + b may have to be assumed to describe the 
data with any accuracy. Such a postulated relation might be quadratic, 


y =ax° +bx +c, 


or x and y might be inversely related, 
a 
y= — +b. 
x 


You can still apply the method of least squares to construct a function analogous 

to D in equation (1) to find the relation of a given form that best fits the data. 
Another way that least squares arise is if y depends not on one variable but 

on several: x1, X2,..., Xn. For example, perhaps adult height is measured against 


Particle 
x 


| 


Figure 4.37 A particle traveling 
ina force field F. 
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blood levels of 10 different proteins instead of just one. Multiple regression is 
the statistical method of finding the linear function 

Y= axı + ax + +++ + anXn +b 
that best fits a data set of (n + 1)-tuples 


a) .@) 1 (2) 2) 2 (k) k 
lo s X2 Paa a yhe a gaas xP, Pi) aa a- (x ae ai x, yf. 


We can find such a “best fit hyperplane” by minimizing the sum of the squares of 
the differences between the y-values furnished by the data set and those predicted 
by the linear formula. We leave the details to you.? 


Physical Equilibria 
Let F: X C R? —> R? be a continuous force field acting on a particle that moves 


along a path x: Z C R —> R? as in Figure 4.37. Newton’s second law of motion 
states that 


FQ) = mx"(), (5) 


where m is the mass of the particle. For the remainder of this discussion, we will 
assume that F is a gradient field, that is, that F = —VV for some C! potential 
function V: X C R? —> R. (See §3.3 fora brief comment about the negative sign.) 
We first establish the law of conservation of energy. 


THEOREM 4.2 (CONSERVATION OF ENERGY) Given the set-up above, the 
quantity 


Sml|x'(t)|° + VEA) 


is constant. 


The term im ||x’(t)||? is usually referred to as the kinetic energy of the particle 
and the term V(x(f)) as the potential energy. The significance of Theorem 4.2 
is that it states that the sum of the kinetic and potential energies of a particle 
is always fixed (conserved) when the particle travels along a path in a gradient 
vector field. For this reason, gradient vector fields are also called conservative 
vector fields. 


Proof of Theorem 4.2 As usual, we show that the total energy is constant by 
showing that its derivative is zero. Thus, using the product rule and the chain rule, 
we calculate 


< [imo x(t) + Vex) | = mx" (t) - x(t) + VV (x(t) -x(t 
= mx" (t) x (t) — F(x(1) x(t) 
= mx" (t)- x’ (t) — mx" (t). x'(t) 
=0, 
from the definitions of F and V and by formula (5). E 
3 Or you might consult S. Weisberg, Applied Linear Regression, 2nd ed., Wiley-Interscience, 1985, 


Chapter 2. Be forewarned, however, that to treat multiple regression with any elegance requires somewhat 
more linear algebra than we have presented. 
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Figure 4.38 For a stable 
equilibrium point, the path of a 
nearby particle with a sufficiently 
small kinetic energy will remain 
nearby with a bounded kinetic 


energy. 


In physical applications it is important to identify those points in space that 
are “rest positions” for particles moving under the influence of a force field. These 
positions, known as equilibrium points, are such that the force field does not act 
on the particle so as to move it from that position. Equilibrium points are of two 
kinds: stable equilibria, namely, equilibrium points such that a particle perturbed 
slightly from these positions tends to remain nearby (for example, a pendulum 
hanging down at rest) and unstable equilibria, such as the act of balancing a ball 
on your nose. The precise definition is somewhat technical. 


DEFINITION 4.3 Let F: X C R” — R” be any force field. Then xp € X is 
called an equilibrium point of F if F(xọ) = 0. An equilibrium point xo is 
said to be stable if, for every r, € > 0, we can find other numbers ro, €o > 0 
such that if we place a particle at position x with ||x — xo|| < rọ and provide it 
with a kinetic energy less than €ọ, then the particle will always remain within 
distance r of Xp with kinetic energy less than e€. 


In other words, a stable equilibrium point xq has the following property: You 
can keep a particle inside a specific ball centered at xp with a small kinetic energy 
by starting the particle inside some other (possibly smaller) ball about xp and 
imparting to it some (possibly smaller) initial kinetic energy. (See Figure 4.38.) 


THEOREM 4.4 Fora C! potential function V of a vector field F = —VV, 
1. The critical points of the potential function are precisely the equilibrium 
points of F. 


2. If Xo gives a strict local minimum of V, then Xo is a stable equilibrium point 
of F. 


EXAMPLE 2 The vector field F = (—6x — 2y — 2)i + (—2x — 4y + 2)j is 
conservative and has 


V(x, y) = 3x? + 2xy + 2x + 2y? —2y4+4 


as a potential function (meaning that F = —VV, according to our current sign 
convention). There is only one equilibrium point, namely, (—ż, +). To see if it is 
stable, we look at the Hessian of V: 


6 2 
avi-2g=[$ ih 
3 4 


The sequence of principal minors is 6, 20. By the Hessian criterion, (—2, $) is a 
strict local minimum of V and, by Theorem 4.4, it must be a stable equilibrium 


point of F. + 


Proof of Theorem 4.4 The proof of part 1 is straightforward. Since F = —VV, 
we see that F(x) = 0 if and only if VV (x) = 0. Thus, equilibrium points of F are 
the critical points of V. 

To prove part 2, let xo be a strict local minimum of V and x: J > R” aC! 
path such that x(f9) = Xo for some tọ € I. By conservation of energy, we must 
have, for all t € J, that 


im IX O + VE) = imix toll? + Vx). 


Figure 4.39 On the surface 

S = {x | g(x) = c}, the component 
of F that is tangent to S at x is 
denoted by ®(x). 
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To show that xo is a stable equilibrium point, we desire to show that we can bound 
the distance between x(t) and xo = x(t) by any amount r and the kinetic energy 
by any amount e. That is, we want to show we can achieve 


I|x(t) — xoll < r 
(i.e., x(t) € B,(Xo) in the notation of §2.2) and 
tmx Ol? < e. 


As the particle moves along x away from Xo, the potential energy must increase 
(since Xo is assumed to be a strict local minimum of potential energy), so the 
kinetic energy must decrease by the same amount. For the particle to escape from 
B,(Xo), the potential energy must increase by a certain amount. If €ọ is chosen to 
be smaller than that amount, then the kinetic energy cannot decrease sufficiently 
(so that the conservation equation holds) without becoming negative. This being 
clearly impossible, the particle cannot escape from B,(xo). | 


Often a particle is not only acted on by a force field but also constrained to lie 
in a surface in space. The set-up is as follows: F is a continuous vector field on R? 
acting on a particle that lies in the surface S = {x € R? | g(x) = c}, where g isa 
C! function such that V g(x) Æ 0 for all x in S. Most of the comments made in the 
unconstrained case still hold true, provided F is replaced by the vector component 
of F tangent to S. Since, atx € S, V g(x) is normal to S, this tangential component 
of F at x is 


(x) = F(X) — projvgw FA). (6) 
(See Figure 4.39.) Then in place of formula (5), we have, for a path x: Z C R > S, 
SAH) = mx" (t). (7) 


We can now state a “constrained version” of Theorem 4.4. 


THEOREM 4.5 Fora C! potential function V of a vector field F = —VV, 


1. If V|s has an extremum at xp € S, then xo is an equilibrium point in S. 


2. If V|s has a strict local minimum at xg € S, then xo is a stable equilibrium 
point. 


Sketch of proof For part 1, if V |s has an extremum at xo, then, by Theorem 3.1, 
we have, for some scalar A, that 


VV (Xo) = AVg(Xo). 
Hence, because F = —VV, 
F(x) = —AV g(Xo), 


implying that F is normal to S at xo. Thus, there can be no component of F tangent 
to S at xq (i.e., ®(xo) = 0). Since the particle is constrained to lie in S, we see 
that the particle is in equilibrium in S. 

The proof of part 2 is essentially the same as the proof of part 2 of Theorem 
4.4. The main modification is that the conservation of energy formula in Theorem 
4.2 must be established anew, as its derivation rests on formula (5), which has 
been replaced by formula (7). Consequently, using the product and chain rules, 
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X +y+(z-2r} =r? 


x 


Figure 4.40 On the sphere 
x? 4+y+(¢—2ry =r’, the 
points (0, 0, r) and (0, 0, 3r) 
are equilibrium points for the 
gravitational force field 

F = —megk. 


we check, for x: / > S, 


d d 
< | dmx (Ol? + VEO] = =| dmx’) -X') + VO) | 


= mx" (t) e x' (t) + VV(x(t)) -x (t). 


Then, using formula (6), we have 


d 
L [im (OI? + Va] 


Fy x(t). mx" (t) — F(x(t)) -x‘(t) 


x'(t) + B(x(1)) — FOC) +x) 

x'(t) + [FERE — projygamF(x(1)] 
-FQ x(t) 

= — x'(t) + projyex F(X) 


after cancellation. Thus, we conclude that 


d 
< | mix (i? + Vew] = 0. 


since x’(r) is tangent to the path in S and, hence, tangent to S itself at x(t), while 
PFO} vy o¢x(ry) (X(t) is parallel to V g(x(t)) and, hence, perpendicular to S at x(t). m 


II 


EXAMPLE 3 Near the surface of the earth, the gravitational field is ap- 
proximately 


F = —megk. 


(We’re assuming that, locally, the surface of the earth is represented by the plane 
z = 0.) Note that F = —VV, where 


V(x, y, Z) = mgz. 
Now suppose a particle of mass m lies on a small sphere with equation 
A(x, y,z= x? + y? +(z- 2r}? =r’, 
We can find constrained equilibria for this situation, using a Lagrange multiplier. 
The gradient equation VV = AV/h, along with the constraint, yields the system 
0 = 2Ax 
0= 2ry 
mg = 2X(z — 2r) 
xy? 4+(z—2rP =r? 
Because m and g are nonzero, à cannot be zero. The first two equations imply 
x = y = 0. Therefore, the last equation becomes 
(Z- 2ry =r’, 
which implies 
Z=r,3r 
are the solutions. Consequently, the positions of equilibrium are (0, 0, r) and 
(0, 0, 3r) (corresponding to à = —mg/2r and +mg/2r, respectively). From ge- 


ometric considerations, we see V is strictly minimized at S at (0, 0, 7) and maxi- 
mized at (0, 0, 37) as shown in Figure 4.40. From physical considerations, (0, 0, r) 
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is a stable equilibrium and (0, 0, 3r) is an unstable one. (Try balancing a marble 
on top of a ball.) + 


Applications to Economics 


We present two illustrations of how Lagrange multipliers occur in problems in- 
volving economic models. 


EXAMPLE 4 The usefulness of amounts x1, x2, . . . , Xn Of (respectively) dif- 
ferent capital goods G1, G2, ..., G, can sometimes be measured by a function 
U(x], X2, .. - , Xn), called the utility of these goods. Perhaps the goods are indi- 
vidual electronic components needed in the manufacture of a stereo or computer, 
or perhaps U measures an individual consumer’s utility for different commodi- 
ties available at different prices. If item G; costs a; per unit and if M is the total 
amount of money allocated for the purchase of these n goods, then the consumer 
or the company needs to maximize U (x1, x2,..., Xn) Subject to 


A,X, + 9X2 + ` +t + anXn = M. 


This is a standard constrained optimization problem that can readily be approached 
by using the method of Lagrange multipliers. 

For instance, suppose you have a job ordering stationery supplies for an office. 
The office needs three different types of products a, b, and c, which you will order 
inamounts x, y, and z, respectively. The usefulness of these products to the smooth 
operation of the office turns out to be modeled fairly well by the utility function 
U(x, y,z) =xy + xyz. If product a costs $3 per unit, product b $2 a unit, and 
product c $1 a unit and the budget allows a total expenditure of not more than 
$899, what should you do? The answer should be clear: You need to maximize 


U(x, y,z)=xy+xyz subjectto B(x, y,z)=3x+2y+z= 899. 


The Lagrange multiplier equation, VU(x, y, z) = AV B(x, y, z), and the budget 
constraint yield the system 


ytyz= 3a 
x+xz=2A 
xy=i 


3x +2y+z= 899 
Solving for A in the first three equations yields 


a a i 2 |T 


The last equality implies that either x = 0 or y = (z + 1)/2. We can reject the 
first possibility, since U (0, y, z) = 0 and the utility U(x, y, z) > 0 whenever 
x, y, and z are all positive. Thus, we are left with y = (z + 1)/2. This in turn 
implies that A = (z + 1)?/6. Substituting for y in the constraint equation shows 
that x = (898 — 2z)/3, so that equation xy = 4 becomes 


(EFN 
3 i a g 


which is satisfied by either z = —1 (which we reject) or by z = 299. The only 
realistic critical point for this problem is (100, 150, 299). We leave it to you to 
check that this point is indeed the site of a maximum value for the utility. + 


302 


Chapter 4 | Maxima and Minima in Several Variables 


EXAMPLE 5 In 1928, C. W. Cobb and P. M. Douglas developed a simple model 
for the gross output Q of a company or a nation, indicated by the function 


QO(K, L) = AK*L', 


where K represents the capital investment (in the form of machinery or other 
equipment), L the amount of labor used, and A and a positive constants with 
0 <a <1. (The function Q is known now as the Cobb-Douglas production 
function.) If you are president of a company or nation, you naturally wish to 
maximize output, but equipment and labor cost money and you have a total 
amount of M dollars to invest. If the price of capital is p dollars per unit and 
the cost of labor (in the form of wages) is w dollars per unit, so that you are 
constrained by 


B(K,L) = pK +wL <M, 


what do you do? 

Again, we have a situation ripe for the use of Lagrange multipliers. Before 
we consider the technical formalities, however, we consider a graphical solution. 
Draw the level curves of Q, called isoquants, as in Figure 4.41. Note that Q 
increases as we move away from the origin in the first quadrant. The budget 
constraint means that you can only consider values of K and L that lie inside or 
on the shaded triangle. It is clear that the optimum solution occurs at the point 
(K, L) where the level curve is tangent to the constraint line pK + wL = M. 

Here is the analytical solution: From the equation V Q(K, L) = AV B(K, L) 
plus the constraint, we obtain the system 


AaK*!L!4 = àp 

A(l -—a)K°L™“ =hw. 

pK +wL =M 
Solving for p and w in the first two equations yields 


Aa A(1 — a) 


p= Ea and w= KL=”. 


Increasing Q 


pK+wL=M 


Optimum value 


Figure 4.41 A family of isoquants. The optimum value of Q(K, L) subject to the 
constraint pK + wL = M occurs where a curve of the form Q = c is tangent to the 
constraint line. 
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Substitution of these values into the third equation gives 


A AQ — 
SAKLI + ( a) KIL!” = M. 
Thus, 
A 
J= S Ke 
M 
and the only critical point is 
Ma M(l-—a) 
(K, L) = | —, ——— }. 
p w 


From this geometric discussion, we know that the critical point must yield the 
maximum output Q. 
From the Lagrange multiplier equation, at the optimum values for L and K, 
we have 
r 130 10Q 
~ pəK wal’ 


This relation says that, at the optimum values, the marginal change in output per 
dollar’s worth of extra capital equals the marginal change per dollar’s worth of 
extra labor. In other words, at the optimum values, exchanging labor for capital 
(or vice versa) won’t change the output. This is by no means the case away from 
the optimum values. 

There is not much that is special about the function Q chosen. Most of our 
observations remain true for any C? function Q that satisfies the conditions 


aQ aQ 30 a0 
’ = 0, Sea) ae 
aK OL aK2° ƏL? 
If you consider what these relations mean qualitatively about the behavior of the 
output function with respect to increases in capital and labor, you will see that 


they are entirely reasonable assumptions.* + 
4.4 Exercises 
1. Find the line that best fits the following data: (0, 2), (b) Show that the “best fit” curve of the form y = 
(1, 3), 2, 5), G, 3), (4, 2), (5, 7), (6, 7). a/x + b should have 
2. Show that if you have only two data points (x1, y1) and 
(x2, y2), then the best fit line given by the method of gus n3 lt — (x 1/x:) (x yi) 
least squares is, in fact, the line through (x1, y1) and E 2 
(2, y2). nD Ua? — (Limi) 
3. Suppose that you are given n pairs of data (x1, yı), and 
(x2, Y2),-+-, (Xn, Yn) and you seek to fit a function of 
the form y = a/x + b to these data. (£ 1/3?) (= vi) — (£ 1/x:) (£ vi/xi) 
(a) Use the method of least squares as outlined in this b= 2 : 
section to construct a function D(a, b) that gives ny 1/x? — (= 1/ xi) 
the sum of the squares of the distances between 
observed and predicted y-values of the data. (All sums are from i = 1 ton.) 


* For more about the history and derivation of the Cobb-Douglas function, consult R. Geitz, “The Cobb- 
Douglas production function,’ UMAP Module No. 509, Birkhauser, 1981. 
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4. Find the curve of the form y = a/x +b that best fits 


the following data: (1, 0), (2, —1), (5, 1), and (3, —}). 
(See Exercise 3.) 


. Suppose that you have n pairs of data (x1, yı), 


(x2, y2), +--+; (Xn, Yn) and you desire to fit a quadratic 
function of the form y = ax? + bx + c to the data. 
Show that the “best fit” parabola must have coefficients 
a, b, and c satisfying 


a a e a 
(Caja + (CaP )o+ (Ea) = En 
(Za + (£x)b +ac=} y; 


(All sums are from i = 1 to n.) 


. (Note: This exercise will be facilitated by the use of 


a spreadsheet or computer algebra system.) Egbert 

recorded the number of hours he slept the night before 

a major exam versus the score he earned, as shown in 

the table below. 

(a) Find the line that best fits these data. 

(b) Find the parabola y = ax? + bx + c that best fits 
these data. (See Exercise 5.) 

(c) Last night Egbert slept 6.8 hr. What do your an- 
swers in parts (a) and (b) predict for his score on 
the calculus final he takes today? 


Hours of sleep | Test score 
8 85 
8.5 72 
9 95 
7 68 
4 52 
8.5 75 
7.5 90 
6 65 
Let F = (—2x — 2y — 1)i+ (2x — 6y = 2)j. 
(a) Show that F is conservative and has potential 


function 
V(x, y)= x? + 2xy + 3y? +x+2y 


(ie., F = —V V). 


(b) What are the equilibrium points of F? The stable 
equilibria? 


. Suppose a particle moves in a vector field F in R? with 


physical potential 
V(x, y) = 2x? — 8xy — y? + 12x — 8y + 12. 


Find all equilibrium points of F and indicate which, if 
any, are stable equilibria. 


9. Let a particle move in the vector field F in R? whose 
physical potential is given by 
V(x, y, Z) = 3x? + 2xy + z? — 2yz + 3x + Sy — 10. 
Determine the equilibria of F and identify those that 
are stable. 


10. Suppose that a particle of mass m is constrained to 
move on the ellipsoid 2x? + 3y? + z? = 1 subject to 
both a gravitational force F = —m gk, as well as to an 
additional potential V(x, y, z) = 2x. 

(a) Find any equilibrium points for this situation. 


(b) Are there any stable equilibria? 


11. The Sukolux Vacuum Cleaner Company manufactures 
and sells three types of vacuum cleaners: the standard, 
executive, and deluxe models. The annual revenue in 
dollars as a function of the numbers x, y, and z (re- 
spectively) of standard, executive, and deluxe models 
sold is 


R(x, y, z) = xyz? — 25,000x — 25,000y — 25,000z. 


The manufacturing plant can produce 200,000 total 
units annually. Assuming that everything that is manu- 
factured is sold, how should production be distributed 
among the models so as to maximize the annual 
revenue? 


12. Some simple electronic devices are to be designed to 
include three digital component modules, types 1, 2, 
and 3, which are to be kept in inventory in respective 
amounts x1, x2, and x3. Suppose that the relative im- 
portance of these components to the various devices is 
modeled by the utility function 


U (x1, X2, X3) = x1X2 + 2x1X3 + X1X2X3. 


You are authorized to purchase $90 worth ofthese parts 
to make prototype devices. If type 1 costs $1 per com- 
ponent, type 2 $4 per component, and type 3 $2 per 
component, how should you place your order? 


13. A farmer has determined that her cornfield will yield 
corn (in bushels) according to the formula 


B(x, y) = 4x? + y? + 600, 


where x denotes the amount of water (measured in 
hundreds of gallons) used to irrigate the field and y 
the number of pounds of fertilizer applied to the field. 
The fertilizer costs $10 per pound and water costs $15 
per hundred gallons. If she can allot $500 to prepare 
her field through irrigation and fertilization, use a 
Lagrange multiplier to determine how much water and 
fertilizer she should purchase in order to maximize her 
yield. 


14. A textile manufacturer plans to produce a cashmere/ 
cotton fabric blend for use in making sweaters. The 
amount of fabric that can be produced is given by 


f(x, y) = 4xy — 2x — 8y + 3, 


15. 


True/False Exercises for Chapter 4 


11. 
12. 


where x denotes the number of pounds of raw cash- 

mere used is and y is the number of pounds of raw 

cotton. Cotton costs $2 per pound and cashmere costs 
$8 per pound. 

(a) Ifthe manufacturer can spend $1000 on raw mate- 
rials, use a Lagrange multiplier to advise him how 
he should adjust the ratio of materials in order to 
produce the most cloth. 


(b) Now suppose that the manufacturer has a budget 
of B dollars. What should the ratio of cotton to 
cashmere be (in terms of B)? What is the limiting 
value of this ratio as B increases? 


The CEO of the Wild Widget Company has decided 
to invest $360,000 in his Michigan factory. His eco- 
nomic analysts have noted that the output of this 
factory is modeled by the function Q(K,L)= 
60K '/3L7/3, where K represents the amount (in thou- 


. If f isa function of class C? and P2 denotes the second- 


order Taylor polynomial of f at a, then f(x) © p2(x) 
when x © a. 


. The increment Af ofa function f(x, y) measures the 


change in the z-coordinate of the tangent plane to the 
graph of f. 


. The differential df of a function f(x, y) measures the 


change in the z-coordinate of the tangent plane to the 
graph of f. 


. The second-order Taylor polynomial of f(x, y, z) = 


x? +3xz + y? at (1, —1, 2) is po(x, y, zZ) = x? + 3xz 
+y. 


. The second-order Taylor polynomial of f(x, y) = 


x? + 2xy + y at (0, 0) is po(x, y) = 2xy + y. 


. The second-order Taylor polynomial of f(x, y) = 


x? +2xy + y at (1, —1) is po(x, y) = 2xy + y. 


. Near the point (1,3,5), the function f(x, y,z)= 


3x4 + 2y? + z? is most sensitive to changes in z. 


. The Hessian matrix H f (x1, .. . , Xn) of f has the prop- 
erty that Hf(x1,...,4n)’ = Hf,- Xn). 

. If Vf(ai,..., an) = 0, then f has a local extremum 
ata = (aisses dy). 


. If f is differentiable and has a local extremum at 


a = (a1, . .. , an), then V f(a) = 0. 
The set {(x, y, z) | 4 < x? + y? + z? < 9} is compact. 


The set {(x, y) | 2x — 3y = 1} is compact. 


16. 


13. 


14. 


15. 


16. 
17. 


18. 


19. 


20. 


21. 


22. 


23. 
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sands of dollars) spent on capital equipment and L 

represents the amount (also in thousands of dollars) 

spent on labor. 

(a) How should the CEO allocate the $360,000 
between labor and equipment? 


(b) Check that 0Q/0K =0Q/0L at the optimal 
values for K and L. 


Let Q(K,L) be a production function for a com- 
pany where K and L represent the respective amounts 
spent on capital equipment and labor. Let p denote the 
price of capital equipment per unit and w the cost of 
labor per unit. Show that, subject to a fixed produc- 
tion Q(K, L) =c, the total cost M of production is 
minimized when K and L are such that 


100 100 
~ wal’ 


Any continuous function f(x, y) must attain a global 
maximum on the disk {(x, y) | x? + y? < 1}. 


Any continuous function f(x, y,z) must attain a 
global maximum on the ball {(x, y, z) | (x — 1}? + 
(y+ 1? +2 <4}. 


If f(x, y) is of class C*, has a critical point at (a, b), 
and fix(a, b) fyy(a, b) = f(a, by < 0, then f has a 
saddle point at (a, b). 


If det H f(a) = 0, then f has a saddle point at a. 


The function f(x, y,z) = x?y?z — x?(y +z) has a 
saddle point at (1, —1, 2). 


The function f(x, y, z) = x? + y? + z? — yz has a lo- 
cal maximum at (0, 0, 0). 


The function f(x, y, z) = xy? — x7z + z has a degen- 
erate critical point at (—1, 0, 0). 


The function F (x1, ..., Xn) = 2x1 — 1)? — 3&2 — D? 
++ (D+! (n + IQ, — n’ has a critical point at 
(1,2,...,n). 


The function F(x1, ..., Xn) = 2x1 — 1)? — 3(x2 — D? 
4+---4+(-1)'*(n + D(x, — ny has a minimum at 
(1,2,...,n). 


All local extrema of a function of more than one vari- 
able occur where all partial derivatives simultaneously 
vanish. 


All points a=(qa),...,a)) where the function 
f(xi,..., Xn) has an extremum subject to the con- 
straint that g(x1,..., Xn) =c, are solutions to the 
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a 
Bf es 
Ox, axı 
of. 8 
aoe 28 
OXp OXp 
P(X isa: By) Se 
24. Any solution (A;,..., Ax, X1, - -< , Xn) to the system of 
equations 
af O81 8k 
we HA te te 
Ox, i Ox) i Ox, 
af O81 Ok 
— . À 3 
ax, dx, Ox, 


25. 


Miscellaneous Exercises for Chapter 4 
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system of equations 


Bi(X1,--- Mn = c1 


pree Xn) = Ce 


yields a point (x1, ..., Xn) that is an extreme value 
of f subject to the simultaneous constraints g; = 
Ciis.aa85: Bk = Che 


To find the critical points of the function f(x, y, z, w) 
subject to the simultaneous constraints g(x, y, z, w) = 


Let V = 2r*h, where r © ro and h © ho. What re- 
lationship must hold between rp and ho for V to be 
equally sensitive to small changes in r and h? 


(a) Find the unique critical point of the function 


EAPC) 2 
ATA 


F(%1, X2,-- 


+4 Xp) A 


(b) Use the Hessian criterion to determine the nature 
of this critical point. 


The Java Joint Gourmet Coffee House sells top-of- 
the-line Arabian Mocha and Hawaiian Kona beans. 
If Mocha beans are priced at x dollars per pound 
and Kona beans at y dollars per pound, then mar- 
ket research has shown that each week approximately 
80 — 100x + 40y pounds of Mocha beans will be sold 
and 20 + 60x — 35y pounds of Kona beans will be 
sold. The wholesale cost to the Java Joint owners is 
$2 per pound for Mocha beans and $4 per pound for 
Kona beans. How should the owners price the coffee 
beans in order to maximize their profits? 


. The Crispy Crunchy Cereal Company produces three 


brands, X, Y, and Z, of breakfast cereal. Each month, 
x, y, and z (respectively) 1000-box cases of brands X, 


26. 


27. 


28. 


29. 


30. 


c, h(x, y, z, w) = d, k(x, y, z, w) = e using the tech- 
nique of Lagrange multipliers, one will have to solve 
a system of four equations in four unknowns. 


Suppose that f(x, y, z) and g(x, y, z) are of class C! 
and that (xo, yo, Zo) is a point where f achieves a maxi- 
mum value subject to the constraint that g(x, y, z) = c 
and that Vg(xo, yo, Zo) is nonzero. Then the level set 
of f that contains (xo, yo, Zo) must be tangent to the 
level set S = {(x, y, z) | g(x, y,z) = c}. 


The critical points of f(x, y,z)=xy+2xz+2yz 
subject to the constraint that xyz = 4 are the same 
as the critical points of the function F(x, y) = 
8 8 
SYP Ha es 
x y 


Given data points (3, 1), (4, 10), (5, 8), (6, 12), to find 
the best fit line by regression, we find the minimum 
value of the function D(m,b) = (3m +b — 1} + 
(4m +b — 10) + (5m +b — 8) + (6m + b — 12). 


All equilibrium points of a gradient vector field 
are minimum points of the vector field’s potential 
function. 


Given an output function for a company, the marginal 
change in output per dollar investment in capital is the 
same as the marginal change in the output per dollar 
investment in labor. 


Y, and Z are sold at a selling price (per box) of each 
cereal given as follows: 


Brand | No. cases sold | Selling price per box 
X x 4.00 — 0.02x 
Y y 4.50 — 0.05y 
Z z 5.00 — 0.10z 


(a) What is the total revenue R if x cases of brand X, y 
cases of brand Y, and z cases of brand Z are sold? 

(b) Suppose that during the month of November, brand 
X sells for $3.88 per box, brand Y for $4.25, and 
brand Z for $4.60. If the price of each brand is in- 
creased by $0.10, what effect will this have on the 
total revenue? 


(c) What selling prices maximize the total revenue? 


. Find the maximum and minimum values of the 


function 
fæ yz) =x- v3y 


on the sphere x? + y? + z? = 4 in two ways: 
(a) by using a Lagrange multiplier; 


(b) by substituting spherical coordinates (thereby de- 
scribing the point (x, y, z) on the sphere as x = 
2sing cos, y = 2sing sin, z=2cosg) and 
then finding the ordinary (i.e., unconstrained) ex- 
trema of f(x(ọ, 8), y(o, 0), z(o, 0)). 


. Suppose that the temperature in a space is given by the 
function 


T(x, y, z) = 200xyz’. 


Find the hottest point(s) on the unit sphere in two ways: 

(a) by using Lagrange multipliers; 

(b) by letting x = sing cos0, y=singsiné, z= 
cos o and maximizing T as a function of the two 
independent variables g and @. (Note: It will help if 
you use appropriate trigonometric identities where 
possible.) 


. Consider the function f(x, y) = (y — 2x?)(y — x). 
(a) Show that f has a single critical point at the origin. 


(b) Show that this critical point is degenerate. Hence, 
it will require means other than the Hessian crite- 
rion to determine the nature of the critical point as 
a local extremum. 


(c) Show that, when restricted to any line that passes 
through the origin, f has a minimum at (0, 0). 
(That is, consider the function F(x) = f(x, mx), 
where m is a constant and the function G(y) = 
f, y)-) 

(d) However, show that, when restricted to the 
parabola y = 3x, the function f has a global 
maximum at (0, 0). Thus, the origin must be a sad- 
dle point. 


Q (e) Use a computer to graph the surface z = f(x, y). 
8. (a) Find all critical points of f(x, y) = xy that satisfy 


xX +y =l. 

(b) Draw a collection of level curves of f and, on the 
same set of axes, the constraint curve x? + y? = 1, 
and the critical points you found in part (a). 

(c) Use the plot you obtained in part (b) and a geomet- 
ric argument to determine the nature of the critical 
points found in part (a). 


9. (a) Find all critical points of f(x, y,z)= xy that 


satisfy x? + y? +22 = 1. 

(b) Give a rough sketch ofa collection of level surfaces 
of f and, on the same set of axes, the constraint 
surface x? + y? +z? = 1, and the critical points 
you found in part (a). 


(c) Use part (b) anda geometric argument to determine 
the nature of the critical points found in part (a). 


10. Find the area A of the largest rectangle so that two 


squares of total area 1 can be placed snugly inside 


11. 


12. 


13. 


14. 


15. 


16. 


17. 
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the rectangle without overlapping, except along their 
edges. (See Figure 4.42.) 


Figure 4.42 Figure for Exercise 10. 


Find the minimum value of 


fa, X2,. 


subject to the constraint that a,x; + azx2 +--+ 
anXn = 1, assuming that a? + aż +--+ a? >0. 


2 2 2 
wey Xn) = XP t tee tx 


Find the maximum value of 


f (x1, X2,- 


subject to x + x +-+ x2 = |. Assume that not all 
of the a;’s are zero. 


oe Xn) = (1x1 + a2x2 +++ + ann), 


Find the dimensions of the largest rectangular box that 
can be inscribed in the ellipsoid x? + 2y? + 4z? = 12. 
Assume that the faces of the box are parallel to the 
coordinate planes. 


Your company must design a storage tank for Super 
Suds liquid laundry detergent. The customer’s specifi- 
cations call for a cylindrical tank with hemispherical 
ends (see Figure 4.43), and the tank is to hold 8000 gal 
of detergent. Suppose that it costs twice as much (per 
square foot of sheet metal used) to machine the hemi- 
spherical ends of the tank as it does to make the cylin- 
drical part. What radius and height do you recommend 
for the cylindrical portion so as to minimize the total 
cost of manufacturing the tank? 


Figure 4.43 The storage tank 
of Exercise 14. 


Find the minimum distance from the origin to the 
surface x? — (y — z} = 1. 


Determine the dimensions of the largest cone that can 
be inscribed in a sphere of radius a. 


Find the dimensions of the largest rectangular box 
(whose faces are parallel to the coordinate planes) that 
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18. 


19. 


20. 
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can be inscribed in the tetrahedron having three faces in 
the coordinate planes and fourth face in the plane with 
equation bcx + acy + abz = abc, where a, b, and c 
are positive constants. (See Figure 4.44.) 


Figure 4.44 Figure for Exercise 17. 


You seek to mail a poster to your friend as a gift. You 

roll up the poster and put it in a cylindrical tube of di- 

ameter x and length y. The postal regulations demand 

that the sum of the length of the tube plus its girth (.e., 

the circumference of the tube) be at most 108 in. 

(a) Use the method of Lagrange multipliers to find the 
dimensions of the largest-volume tube that you can 
mail. 


(b) Use techniques from single-variable calculus to 
solve this problem in another way. 


Find the distance between the line y = 2x + 2 and the 
parabola x = y? by minimizing the distance between 
a point (x1, yı) on the line and a point (x2, y2) on the 
parabola. Draw a sketch indicating that you have found 
the minimum value. 


A ray of light travels at a constant speed in a uniform 
medium, but in different media (such as air and water) 
light travels at different speeds. For example, if a ray of 
light passes from air to water, it is bent (or refracted) 
as shown in Figure 4.45. Suppose the speed of light 


A 


8, 


Figure 4.45 Snell’s law of refraction. 


in medium 1 is vı and in medium 2 is v2. Then, by 

Fermat’s principle of least time, the light will strike the 

boundary between medium 1 and medium 2 at a point 

P so that the total time the light travels is minimized. 

(a) Determine the total time the light travels in going 
from point A to point B via point P as shown in 
Figure 4.45. 


(b) Use the method of Lagrange multipliers to estab- 
lish Snell’s law of refraction: that the total travel 
time is minimized when 


sin 1 Vv} 


sinô v` 
(Hint: The horizontal and vertical separations of 
A and B are constant.) 


21. Use Lagrange multipliers to establish the formula 


p — exo t byo = dl 
/ a2 + b2 
for the distance D from the point (xo, yo) to the line 


ax + by =d. 
22. Use Lagrange multipliers to establish the formula 
|axo + byo + czo — d| 


for the distance D from the point (xo, yo, Zo) to the 
plane ax + by + cz = d. 


D= 


23. (a) Show that the maximum value of f(x, y,z)= 
x?y?z? subject to the constraint that x? + y? + 


z? =a? is 
aê a? 
7 (5) l 


(b) Use part (a) to show that, for all x, y, and z, 
xX Hya 
a 
(c) Show that, for any positive numbers x1, X2, ..., Xn, 
Xp + XQ +++ +X, 

n ` 


y?) < 


(1x2 e Xn) < 


The quantity on the right of the inequality is the 
arithmetic mean of the numbers x1, X2, ..., Xn, 
and the quantity on the left is called the geomet- 
ric mean. The inequality itself is, appropriately, 
called the arithmetic-geometric inequality. 
(d) Under what conditions will equality hold in the 
arithmetic-geometric inequality? 
In Exercises 24-27 you will explore how some ideas from ma- 
trix algebra and the technique of Lagrange multipliers come 
together to treat the problem of finding the points on the unit 
hypersphere 


gies Xn) =x HHH ta2=1 


that give extreme values of the quadratic form 


n 
fises Xn) = X aijXiXj, 


i j=l 


where the aj; s are constants. 


24. (a) Use a Lagrange multiplier À to set up a system of 


n + l equations in n + 1 unknowns x1, ... , Xn, À 
whose solutions provide the appropriate con- 
strained critical points. 


(b) Recall that formula (2) in §4.2 shows that the 
quadratic form f may be written in terms of ma- 
trices as 


fX,- Xn) = X" Ax, (1) 


where the vector x is written as the n x 1 matrix 
Xx] 
and A isthen x n matrix whose i jth entry 
Xn 


is a;j. Moreover, as noted in the discussion in §4.2, 
the matrix A may be taken to be symmetric (i.e., so 
that AT = A), and we will therefore assume that 
A is symmetric. Show that the gradient equation 
V f =AVg is equivalent to the matrix equation 


Ax = Ax. (2) 


Since the point (x1, ..., X,) satisfies the constraint 
x? +- + x2 = 1, the vector x is nonzero. If you 
have studied some linear algebra, you will recog- 
nize that you have shown that a constrained criti- 
cal point (x1, ..., Xn) for this problem corresponds 
precisely to an eigenvector of the matrix A asso- 
ciated with the eigenvalue å. 
x] 
(c) Now suppose that x = is one of the eigen- 
Xn 
vectors of the symmetric matrix A, with associated 
eigenvalue à. Use equations (1) and (2) to show, if 
x is a unit vector, that 


Fises 


Hence, the (absolute) minimum value that f 
attains on the unit hypersphere must be the small- 
est eigenvalue of A and the (absolute) maximum 
value must be the largest eigenvalue. 


-o Xn) =A. 


25. Letn = 2 in the situation of Exercise 24, so that we are 


considering the problem of finding points on the circle 
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26. 


27. 
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x? + y? = 1 that give extreme values of the function 


fay= ax? + 2bxy + cy’ 


senaj G] 


b 


ing the constrained critical points of the optimiza- 
tion problem described above. 


(a) Find the eigenvalues of A = | i | by identify- 


(b) Now use some algebra to show that the eigenval- 
ues you found in part (a) must be real. It is a fact 
(that you need not demonstrate here) thatany n x n 
symmetric matrix always has real eigenvalues. 


In Exercise 25 you noted that the eigenvalues Aj, 2 
that you obtained are both real. 


(a) Under what conditions does 4; = 2? 


(b) Suppose that A; and A» are both positive. Explain 
why f must be positive on all points of the unit 
circle. 


(c) Suppose that A; and A, are both negative. Explain 
why f must be negative on all points of the unit 
circle. 


Let f be a general quadratic form in n variables de- 
termined by an n x n symmetric matrix A, that is, 
fan taa Xa) = em đijXiXj = x’ Ax. 

(a) Show, for any real number k, that f(kx1,..., 
kxn) =k? f(x1,...,%,). (This means that a 
quadratic form is a homogeneous polynomial of 
degree 2—see Exercises 37—44 of the Miscella- 
neous Exercises for Chapter 2 for more about ho- 
mogeneous functions.) 


(b) Use part (a) to show that if f has a positive 
minimum on the unit hypersphere, then f must 
be positive for all nonzero x € R” and that if f 
has a negative maximum on the unit hypersphere, 
then f must be negative for all nonzero x € R”. 
(Hint: For x Æ 0, let u = x/||x||, so that x = ku, 
where k = ||x||.) 


(c) Recall from §4.2 that a quadratic form f is said 
to be positive definite if f(x) > 0 for all nonzero 
x € R” and negative definite if f(x) < 0 for all 
nonzero x € R”. Use part (b) and Exercise 24 to 
show that the quadratic form f is positive definite 
if and only if all eigenvalues of A are positive, and 
negative definite if and only if all eigenvalues of 
A are negative. (Note: As remarked in part (b) of 
Exercise 25, all the eigenvalues of A will be real.) 
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y 


H t+— xX 
a b 
Figure 5.1 The graph of 
y= f(x). 
y 
H t— x 
a b 


Figure 5.2 The shaded region has 
area J? f(x)dx. 


Multiple Integration 


5.1 Introduction: Areas and Volumes 


Our purpose in this chapter is to find ways to generalize the notion of the definite 
integral of a function of a single variable to the cases of functions of two or 
three variables. We also explore how these multiple integrals may be used to 
meaningfully represent various physical quantities. 

Let f be a continuous function of one variable defined on the closed interval 
[a, b] and suppose that f has only nonnegative values. Then the graph of f looks 
like Figure 5.1. That f is continuous is reflected in the fact that the graph consists 
ofan unbroken curve. That f is nonnegative-valued means that this curve does not 
dip below the x-axis. We know from one-variable calculus that the definite integral 
J i f(x)dx exists and gives the area under the curve, as shown in Figure 5.2. 

Now suppose that f is a continuous, nonnegative-valued function of two 
variables defined on the closed rectangle 


R={(x,y)€R’la<x<b,c<y<d} 


in R?. Then the graph of f over R looks like an unbroken surface that never dips 
below the xy-plane, as shown in Figure 5.3. In analogy with the single-variable 
case, there should be some sort of integral that represents the volume under the 
part of the graph that lies over R. (See Figure 5.4.) We can find such an integral 
by using Cavalieri’s principle, which is nothing more than a fancy term for the 
method of slicing. Suppose we slice by the vertical plane x = x9, where x is a 
constant between a and b. Let A(xo) denote the cross-sectional area of such a 
slice. Then, roughly, one can think of the quantity A(xo) dx as giving the volume 
of an “infinitely thin” slab of thickness dx and cross-sectional area A(x). (See 
Figure 5.5.) Hence, the definite integral 


b 
V= i A(x) dx 
gives a “sum” of the volumes of such slabs and can be considered to provide a 
reasonable definition of the total volume of the solid. 

But what about the value of A(x9)? Note that A(x9) is nothing more than the 
area under the curve z = f (xo, y), obtained by slicing the surface z = f(x, y) 
with the plane x = xq. Therefore, 


d 
ad= / ieni 


Figure 5.3 
z= f(x, y). 


The graph of 


Plane z=c 


(0, b, 0) 
y 


(a,b, 0) 


Figure 5.6 Calculating the 
volume of the box of Example 1. 


Figure 5.7 


z=4-x? 


Example 2. 


The graph of 
—y of 


5.1 | Introduction: Areas and Volumes 311 


Figure 5.4 The region under the Figure 5.5 A slab of “volume” 
portion of the graph of f lying dV = A(xo)dx. 

over R has volume that is given 

by an integral. 


(remember xo is a constant), and so we find that 


v= f awas f [f se.nayae. (1) 


The right-hand side of formula (1) is called an iterated integral. To calculate 
it, first find an “antiderivative” of f(x, y) with respect to y (by treating x as a 
constant), evaluate at the integration limits y = c and y = d, and then repeat the 
process with respect to x. 


EXAMPLE 1 Let’s make sure that the iterated integral defined in formula (1) 
gives the correct answer in a case we know well, namely, the case of a box. We’ll 
picture the box as in Figure 5.6. That is, the box is bounded on top and bottom by 
the planes z = c (where c > 0) and z = 0, on left and right by the planes y = 0 
and y = b (where b > 0), and on back and front by the planes x = 0 and x = a 
(a > 0). Hence, the volume of the box may be found by computing the volume 
under the graph of z = c over the rectangle 


R={(x,y)|0<x<a,0<y<D}. 


Using formula (1), we obtain 


a b a a 
V= / f cdydx = f (aa) dx = I cbdx = bx) =) = cba. 
o Jo 0 0 


This result checks with what we already know the volume to be, as it should. ® 


EXAMPLE 2 We calculate the volume under the graph of z = 4 — x? — y? 
(Figure 5.7) over the square 


R={(x,y)|-l<x<1, -l<y<l}. 


Using formula (1) once again, we calculate the volume by first integrating with 
respect to y (i.e., by treating x as a constant in the inside integral) and then by 
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y = yo plane 


dy 


Figure 5.8 Slicing by y = yo 
first. 


integrating with respect to x. The details are as follows: 


1 1 1 1 y 
= 4— Zo 2) dyd — (4 y2 -3>) 
V Li x” — y)dy dx I ee A 

1 
=f ((4 x? ;) ( 4x? 4 ;) ) ax 
1 3 3 


2 2AT 22 2 22 2\ 40 
= |- = = = coe 
3 3 Fig N3 3 3 3 3 


In our development of formula (1), we could just as well have begun by slicing 
the solid with the plane y = yo (instead of with the plane x = x9), as shown in 
Figure 5.8. Then, in place of formula (1), the formula that results is 


d b 
v= i Fx, y)dx dy. (2) 


Since the iterated integrals in formulas (1) and (2) both represent the volume of the 
same geometric object, we can summarize the preceding discussion as follows. 


PROPOSITION 1.1 Let R be the rectangle {(x, y)|a <x <b, c < y < d}and 
let f be continuous and nonnegative on R. Then the volume V under the graph 


of f over R is 
b pd d pb 
| | tenayax= f | f(x, y)dx dy. 


EXAMPLE 3 We find the volume under the graph of z = cos x sin y over the 
rectangle 


R= |, y)l0<x<5,0<ys7h. 


(See Figure 5.9.) From formula (1), we calculate that the volume is 


n/2 px/4 m/2 cià 
V =f / cos x sin y dy dx >l (— cos x cos ga dx 
0 0 0 i 


x/2 2 29/2. [7 
=] (Beos - ceos) dx = a cos x dx 
0 2 2 0 


m/2 
2— y2 P 2-2 2— 4/2 
5 sin x = (1 = . 
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Figure 5.9 The surface z = cos x sin y of 
Example 3. 


If we use formula (2) instead of formula (1), we obtain 


m/4 7/4 2 
v=f F ‘eosxsinyaxdy = f" (sinx sin y) W dy 


-{" (sin y — 0)dy = — cos y|3 ^ 
0 


v2 2-2 
= (-l)= 
2 2 
That this result agrees with our first calculation is no surprise given 


Proposition 1.1. + 


5.1 Exercises 


Evaluate the iterated integrals given in Exercises 1—6. 7. Find the volume of the region that lies under the graph 
2 23 of the paraboloid z = x? + y? +2 and over the rect- 
1: i f (x? + y)dydx angle R = {(x,y)|—1 <x < 2,0 < y <2} in two 

0 Ji ways: 


N 


m po (a) by using Cavalieri’s principle to write the volume 
/ f y sinx dy dx as an iterated integral that results from slicing the 
region by parallel planes of the form x = constant; 


w 


J j f i xe dy dx (b) by using Cavalieri’s principle to write the volume 
as an iterated integral that results from slicing the 
region by parallel planes of the form y = constant. 


m/2 
i, [ e* cos y dx dy 8. Find the volume of the region bounded on top by the 
plane z = x + 3y + 1, on the bottom by the xy-plane, 
es and on the sides by the planes x = 0, x = 3, y= 1, 
Jf fe +x? + Iny)dx dy y=2. 

9. Find the volume of the region bounded by the graph 
T [= In Vx dx dy of f(x, y) = 2x? + yt sin zx, the xy-plane, and the 

planes x = 0,x = 1, y l, y=2. 


= 


vi 


oa 


314 


Chapter 5 | Multiple Integration 


In Exercises 10-15, calculate the given iterated integrals and a > 
au, og ae ee j 13. eee 
indicate of what regions in R? they may be considered to 

represent the volumes. 


2 p3 
10. | | 2dx dy 
o Ji 


3 2 
11. / J (16 — x? — y”)dy dx 
1 —2 


j 


m/2 


T 
/ sin x cos y dx dy 


z/2 J0 


i) eee 
15. f fc-wnera 


16. Suppose that f is a nonnegative-valued, continu- 
ous function defined on R = {(x, y)|a<x<b,c< 
y < d}. If f(x, y) < M for some positive number M, 
explain why the volume V under the graph of f over 
R is at most M(b — a)(d — c). 


5.2 Double Integrals 


In the previous section we saw how to calculate volumes of certain solids us- 
ing iterated integrals. The ideas were mostly straightforward, but the situation 
we addressed was rather special: We only solved the problem of computing the 
volume of a solid defined as the region lying under the graph of a continuous, 
nonnegative-valued function f(x, y) and above a rectangle in the xy-plane. It is 
not immediately apparent how we might compute the volume of a more general 
solid based on this work. 

Thus, in this section we define a more general notion of an integral of a 
function of two variables that will allow us to describe 


1. integrals of arbitrary functions (i.e., functions that are not necessarily non- 
negative or continuous) and 


2. integrals over arbitrary regions in the plane (i.e., rather than integrals over 
rectangles only). 


We focus first on case 1. To do this, we start fresh with some careful definitions 
and notation. The ideas involved in Definitions 2.1—2.3 below are different from 
those in the previous section. However, we will see that there is a key connection 
(called Fubini’s theorem) between the notion of an iterated integral discussed 
in §5.1 and that of a double integral, which will be described in Definition 2.3. 


The Integral over a Rectangle 
We also denote a (closed) rectangle 


R={(x,y)€R*|a<x<b,c<y<d} 


by [a, b] x [c, d]. This notation is intended to be analogous to the notation for a 
closed interval. 


DEFINITION 2.1 Given a closed rectangle R = [a, b] x [c, d], a partition 
of R of order n consists of two collections of partition points that break up 
R into a union ofn? subrectangles. More specifically, fori, j = 0,...,n, we 
introduce the collections {x;} and {y;}, so that 


@ = ity Ky Koco SH] KG Koo SH, = 
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and 
C= yo <M <5 < Yj < Yi < c++ <n =a. 


Lei Ave =o =H (OF f= osoni Gal A =e = a Gor 7 = 
1,...,n). Note that Ax; and Ay; are just the width and height (respec- 
tively) of the i jth subrectangle (reading left to right and bottom to top) of the 
partition. 


An example of a partitioned rectangle is shown in Figure 5.10. We do not 
assume that the partition is regular (i.e., that all the subrectangles have the same 
dimensions). 


d=y, 


ł x 
A=Xy X1 Xp Xj_y Xj + X,=D 


Figure 5.10 A partition of the rectangle [a, b] x [c, d]. 


DEFINITION 2.2 Suppose that f is any function defined on R = [a, b] x 
[c, d] and partition R in some way. Let ¢;; be any point in the subrectangle 


fig = (Bi Ball x [gato Wal =F N: 
Then the quantity 
n 
J= D I CETAAT 
ij=1 
where AA;; = Ax; Ay; is the area of R;j, is called a Riemann sum of f on 
R corresponding to the partition. 


The Riemann sum 


S= 5 f(¢ij) AA;j 
Lj 


depends on the function f, the choice of partition, and the choice of the “test 
point” ¢;; in each subrectangle R;; of the partition. The Riemann sum itself is 
just a weighted sum of areas AAj; of subrectangles of the original rectangle R, 
the weighting being given by the value f(c;;). 

If f happens to be nonnegative on R, then, fori, j = 1, ..., n, the individual 
terms f(c;;) A Aj; in S may be considered to be volumes of boxes having base area 
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Figure 5.11 The volume under the graph 
of f is approximated by the Riemann sum. 


A, A; 


A, 


Figure 5.13 If Ay, Ad, A3 
represent the values of the shaded 
areas, then 


Jl Fœ) dx = Ay — Ap + As. 


R 
(in xy-plane) 


Figure 5.14 If V,, V2 represent 
the volumes of the shaded regions, 
then ffr f(x, y)dA = Vi — V2. 
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z=f(x,y) 


Xy-plane 


f<0 here. 

Volume of this 
box enters § 
with a — sign. 


f>0 here. 
Volume of this 
box enters S 
with a + sign. 


Figure 5.12 The Riemann sum as a signed sum of 
volumes of boxes. 


Ax; Ay; and height f (c;;). Therefore, S can be considered to be an approximation 
to the volume under the graph of f over R, as suggested by Figure 5.11. If f is 
not necessarily nonnegative, then the Riemann sum S is a signed sum of such 
volumes (because, with f(c;;) < 0, the term f(c;;)AAj;; is the negative of the 
volume of the appropriate box—see Figure 5.12). 


DEFINITION 2.3 The double integral of f on R, denoted by [fẹ f dA 
(or by ffe f(x, y)dA or by ff, f(x, y)dx dy), is the limit of the Riemann 
sum S as the dimensions Ax; and Ay; of the subrectangles Rj; all approach 
zero, that is, 


[ [14 2 E) Xi Yj> 


provided, of course, that this limit exists. When ffp f dA exists, we say that 
f is integrable on R. 


The crucial idea to remember—indeed, the defining idea—is that the integral 
Jfk f aA is a limit of Riemann sums S, for this concept is what is needed to 
properly apply double integrals to physical situations. 

From a geometric point of view, just as the single-variable definite integral 
J 4 f(x)dx can be used to compute the “net area” under the graph of the curve 
y = f(x) (asin Figure 5.13), the double integral f fẹ f dA can be used to compute 
the “net volume” under the graph of z = f(x, y) (as in Figure 5.14). 

Another way to view the double integral ff r f dA is somewhat less geometric 
but is more in keeping with the notion of the integral as the limit of Riemann 
sums and provides a perspective that generalizes to triple integrals of functions of 
three variables. Instead of visualizing the graph of z = f(x, y) as a surface and 
S= Dr jai f (&j)Ax; Ay; as a (signed) sum of volumes of boxes related to the 
graph, consider S to be a weighted sum of areas and the integral [fẹ f dA the 
limiting value of such weighted sums as the dimensions of all the subrectangles 
approach zero. With this point of view, we do not depict the integrand f when we 
try to visualize the integral. In this way, the distinction between the roles of the 
integrand and the rectangle R over which we integrate can be made clearer. (See 
Figure 5.15.) 
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This subrectangle of 
area AAs; contributes 
f(€s3) AAs; to S. 


Figure 5.15 S = J`; ; f(cij)AAi;. 


EXAMPLE 1 Suppose that a3 cm square metal plate is made, but some nonuni- 
formities exist due to the manufacturing process so that the mass density varies 
somewhat throughout the plate. If we knew the density function ô(x, y) at every 
point in the plate, then we could calculate the total mass of the plate as 


Total mass = |f d(x, y)dA, 
D 


where D denotes the square region of the plate placed in an appropriate coordinate 
system. 

In the absence of an analytic expression for 5, we nonetheless can approximate 
the double integral by means of a Riemann sum: We partition the square region 
of the plate, take density readings at a test point in each subregion, and combine 
to approximate the integral for the total mass. (Essentially what we are doing is 
assuming that the density is nearly constant on each subregion so that multiplying 
density and area will give the approximate mass of the subregion; adding these 
approximate masses then gives an approximation for the total mass.) For example, 
we might model the problem as in Figure 5.16, where the region of the plate is 


y 


T oI 
| 1 | 
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E (0.6) i i e 
l | 
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Figure 5.16 The region of Example 1. The 3 x 3 
square is partitioned into nine subregions. The density 
values at test points in each subregion are shown. 
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zZ 


Figure 5.17 The graph 
of z = x of Example 2. 


Ci 


Figure 5.18 The two 
subrectangles R; ; and R;, j are 
symmetrically placed with respect 
to the y-axis. The corresponding 
test points ¢;,; and ¢;,; are chosen 
so that they have the same 
y-coordinates and opposite 
x-coordinates. 


Figure 5.19 The graph of a 
piecewise continuous function. 


partitioned into nine square subregions. Then we have 


Total mass = Tf 5(x, y)dA ~ Y` 8leiyj)AA;j 
D i,j 


= (0.2)1 + (0.3)1 + (0.6)1 + (0.1)1 + (0.2)1 + (1)1 + (0.3)1 
+ (0.3)1 + (0.5)1 = 3.5. m 


EXAMPLE 2 We determine the value of ffẹxdA, where R = [-2, 2] x 
[—1, 3]. Here the integrand f(x, y) = x and, if we graph z = f(x, y) over R, 
we see that we have a portion of a plane, as shown in Figure 5.17. Note that the 
portion of the plane is positioned so that exactly half of it lies above the xy-plane 
and half below. Thus, if we regard ffp x dA as the net volume under the graph of 
z = x, then we conclude that ffp x dA (if it exists) must be zero. 

On the other hand, we need not resort to visualization in three dimensions. 
Consider a Riemann sum corresponding to ffẹxdA obtained by partitioning 
R = [—2, 2] x [—1, 3] symmetrically with respect to the y-axis and by choosing 
the “test points” ¢;; symmetrically also. (See Figure 5.18.) It follows that the value 
of 


S= > Fe AA = X xij AAij 


(where x;; denotes the x-coordinate of ¢;;) must be zero since the terms of the 
sum cancel in pairs. Furthermore, we can arrange things so that, as we shrink 
the dimensions of the subrectangles to zero (as we must do to get at the integral 
itself), we preserve all the symmetry just described. Hence, the limit under these 
restrictions will be zero, and thus, the overall limit (where we do not impose such 
symmetry restrictions on the Riemann sum), if it exists at all, must be zero as 
well. + 


Example 2 points out fundamental difficulties with Definition 2.3, namely, 
that we never did determine whether ffp f dA really exists. To do this, we would 
have to be able to calculate the limit of Riemann sums of f over all possible 
partitions of R by using all possible choices for the test points ¢;;, a practically 
impossible task. Fortunately, the following result (which we will not prove) pro- 
vides an easy criterion for integrability: 


THEOREM 2.4 If f is continuous on the closed rectangle R, then ffp f dA 
exists. 


In Example 2, f(x, y) =x is a continuous function and hence integrable 
by Theorem 2.4. The symmetry arguments used in the example then show that 
[f,xdA=0. 

Continuous functions are not the only examples of integrable functions. In 
the case of a function of a single variable, piecewise continuous functions are also 
integrable. (Recall that a function f(x) is piecewise continuous on the closed 
interval [a, b] if f is bounded on [a, b] and has at most finitely many points of 
discontinuity on the interior of [a, b]. Its graph, therefore, consists of finitely many 
continuous “chunks” as shown in Figure 5.19.) For a function of two variables, 
there is the following result, which generalizes Theorem 2.4. 
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THEOREM 2.5 If f is bounded on R and if the set of discontinuities of f on R 
has zero area, then ffp f dA exists. 


To say that a set X has zero area as we do in Theorem 2.5, we mean that we 
can cover X with rectangles R1, R2, ..., Rn, ... (1.e., so that X C Uzi R,,), the 
sum of whose areas can be made arbitrarily small. 

A function f satisfying the hypotheses of Theorem 2.5 has a graph that 
looks roughly like the one in Figure 5.20. Theorem 2.5 is the most general suffi- 
cient condition for integrability that we will consider. It is of particular use to us 
when we define the double integral of a function over an arbitrary region in the 


plane. 
z=f@y) A 4 


y 
> Co 
= R Discontinuities of f 
I| O 
K R 
x 


Discontinuities of f 


Figure 5.20 The graph of an integrable 
function. 


Although Theorems 2.4 and 2.5 make it relatively straightforward to check 
that a given integral exists, they do little to help provide the numerical value of 
the integral. To mechanize the evaluation of double integrals, we will use the 
following result: 


THEOREM 2.6 (FUBINI’S THEOREM) Let f be bounded on R = [a, b] x [c,d] 
and assume that the set S of discontinuities of f on R has zero area. If every line 
parallel to the coordinate axes meets S in at most finitely many points, then 


ff raaf [orenayar= f fox. yjdxdy. 


Fubini’s theorem demonstrates that under certain assumptions the double in- 
tegral over a rectangle (i.e., the limit of Riemann sums) can be calculated by using 
iterated integrals and, moreover, that the order of integration for the iterated inte- 
gral does not matter. We remark that the independence of the order of integration 
depends strongly on the fact that the region of integration is rectangular; it will not 
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generalize to more arbitrary regions in such a simple way. (A proof of Theorem 
2.6 is given in the addendum to this section.) 


EXAMPLE 3 Werevisit f fẹ x dA inExample2, where R = [—2, 2] x [—1, 3]. 
By Theorem 2.6, we know that ffp x dA exists and by Fubini’s theorem, we 


calculate 
y=3 
dx 
jaa 


e setae 


2 2 
=f s86- CDdx= f 4xdx = 2x? |, =8-8=0, 
2 —2 


which checks. Furthermore, we also have 


3 p2 3 
Jf xaa=] J xdxdy= | 5x? 
R -1J-2 zi 


PROPOSITION 2.7 (PROPERTIES OF THE INTEGRAL) Suppose that f and g are 
both integrable on the closed rectangle R. Then the following properties hold: 


x= 


2 
_ay= [°,2-2)dy =0. 


+ 


x= 


1. f + g is also integrable on R and 


[frtaar=foraas ff gaa. 


2. cf is also integrable on R, where c € R is any constant, and 


Jf eraa=eff faa 


3. If f(x, y) < g(x, y) for all (x, y) € R, then 


ff tæ naas ff saaa. 


4. | f| is also integrable on R and 


Jlez [fires 


Properties 1 and 2 are called the linearity properties of the double integral. 
They can be proved by considering the appropriate Riemann sums and taking 
limits. For example, to prove property 1, note that the Riemann sum whose limit 


Figure 5.21 A bounded region D 
in the plane. 


Figure 5.22 A type 1 elementary 
region. 


x= B(y) 
x= ay) 


y= 6 


Figure 5.23 A type 2 elementary 
region. 
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is [{,(f + 8)dA is 


SF + g(eiAAij = > (fle) + g(cij)) AAi; 
FE 


i j=l 


= J F&A + D> sedi 
i,j=l 


i, j=l 


R T 


Property 3 (known as monotonicity) and property 4 can also be proved using 
Riemann sums. For property 4, one needs to use the fact that 


n n 
Ya < Y laul. 
k=1 k=1 


Double Integrals over General Regions in the Plane 


Our next step is to understand how to define the integral of a function over an 
arbitrary bounded region D in the plane. Ideally, we would like to give a precise 
definition of ff p f dA, where D is the amoeba-shaped blob shown in Figure 5.21 
and where f is bounded on D. In keeping with the definition of the integral over 
a rectangle, ff f dA should be a limit of some type of Riemann sum and should 
represent the net volume under the graph of f over D. Unfortunately, the techni- 
calities involved in making such a direct approach work are prohibitive. Instead, we 
shall consider only certain special regions (rather than entirely arbitrary ones), and 
we shall assume that the integrand f is continuous over the region of integration 
(which will allow us to use what we already know about integrals over rectangles). 
Although this approach will not provide us with a completely general definition, 
it is sufficient for essentially all the practical situations we will encounter. 
To begin, we define the types of elementary regions we wish to consider. 


DEFINITION 2.8 We say that D is an elementary region in the plane if it 
can be described as a subset of R? of one of the following three types: 


Type 1 (see Figure 5.22): 

D={(x,y)|v@)< y < ô&), a <x < b}, 
where y and ô are continuous on [a, b]. 
Type 2 (see Figure 5.23): 

D={(, y)|a(y) <x < BQ), EE 
where g and £ are continuous on [c, d]. 


Type 3 D is of both type 1 and type 2. 


Thus, a type 1 elementary region D has a boundary (denoted ð D) consisting 
of straight segments (possibly single points) on the left and on the right and graphs 
of continuous functions of x on the top and on the bottom. A type 2 elementary 
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Figure 5.24 The unit disk 
D = {(x, y) | x? + y? < 1} isa 
type 3 region. 


region has a boundary that is straight on the top and bottom and consists of graphs 
of continuous functions of y on the left and right. 


EXAMPLE 4 The unit disk, shown in Figure 5.24, is an example of a type 3 
elementary region. It is a type 1 region since 
1}. 


D=|¢@, y) | -v13 sysVi-¥,-1<x 


(See Figure 5.25.) It is also a type 2 region since 


lA 


D={|@,9)|-vI-y¥ <x < VI- -1 <y< t}: 
(See Figure 5.26.) $ 
y 
y 
y=vV1 -x2 


x 
x=-1 x=1 
y=-vV1-x? y==1 
Figure 5.25 The unit disk D as a Figure 5.26 The unit disk D as a type 2 
type 1 region. region. 


Now we are ready to define ffp f dA, where D is an elementary region and 
f is continuous on D. We construct a new function f°, the extension of f, by 


ext 2 f(x, y) if(x,y)E€ D 
fey) = t eee 


Note that, in general, f®t will not be continuous, but the discontinuities of f™' 
will all be contained in dD, which has no area. Hence, by Theorem 2.5, f°" is 
integrable on any closed rectangle R that contains D. (See Figure 5.27.) 


z y 
R 


a 


x z =f™(x, y) 


Figure 5.27 The graph of z = f™*(x, y). 


y = 3x? 


IDIR (1, 3) 


(2,0) 


Figure 5.28 The domain of f of 
Example 5. 
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DEFINITION 2.9 Under the previous assumptions and notation, if R is any 
rectangle that contains D, we define 


Jf taa to be J| iraa. 


Note that Definition 2.9 implicitly assumes that the choice of the rectangle R 
that contains D does not affect the value of ffe f' dA. This is almost obvious 
but still should be proved. We shall not do so directly but instead establish the 
following key result: 


THEOREM 2.10 Let D be an elementary region in R? and f a continuous 
function on D. 


1. If D is of type 1 (as described in Definition 2.8), then 


b d(x) 
Tf fdA =] f(x, y)dy dx. 
D a Jy(x) 


2. If D is of type 2, then 


d Bly) 
If faa= f f(x, y)dx dy. 
D c Jay) 


Theorem 2.10 provides an explicit and straightforward way to evaluate double 
integrals over elementary regions using iterated integrals. Before we prove the 
theorem, let us illustrate its use. 


EXAMPLE 5 Let D be the region bounded by the parabolas y = 3x7, y = 
4 — x* and the y-axis as shown in Figure 5.28. (Note that the parabolas intersect at 
the point (1, 3).) Since D is a type 1 elementary region, we may use Theorem 2.10 
with f(x, y) = x’y to find that 


1 4—x? 
Tf yda= | 1 x’y dy dx. 
D 0 3x? 


The limits for the first (inside) integration come from the y-values of the top and 
bottom boundary curves of D. The limits for second (outside) integration are the 
constant x-values that correspond to the straight left and right sides of D. The 
evaluation itself is fairly mechanical: 


1 4—x? 1 Dead 
f [eee E 
0 3x2 0 2 


2 
y=4—x" 


y=3x2 
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Note that after the y-integration and evaluation, what remains is a single definite 
integral in x. The result of calculating this x-integral is, of course, a number. Such 
a situation where the number of variables appearing in the integral decreases with 
each integration should always be the case. + 


Proof of Theorem 2.10 For part 1, we may take D to be described as 
D = {x, y) | yŒ) < y < ê(x), a < x < b}. 
We have, by Definition 2.9, that 


ffa- [es 


where R is any rectangle containing D. Let R = [a', b'] x [c’, d'], where a’ < a, 
b' > b,andc’ < y(x), d' > 6(x) for all x in [a, b]. That is, we have the situation 
depicted in Figure 5.29. Since f**' is zero outside of the subrectangle R = 
[a, b] x [c’, d’], 


b fpd 
1 f™ dA = J 7 aA = i i f, y)dydx 
R R a E 


by Fubini’s theorem. For a fixed value of x between a and b, consider the y- 
integral f f(x, y)dy. Since f™ (x, y) = O unless y(x) < y < 4(x) (in which 
case f(x, y) = f(x, y), 


d' ô(x) 
E E 
c y (x) 
and so 
b g 
Jf senaa= ff raaz] f rte. nayax 
D R a c 
b 5(x) 
=f | renayas, 
a Jy(x) 
as desired. 
The proof of part 2 is very similar. m 


Figure 5.29 The region R is the union of R4, Ro, 
andR3. 


Figure 5.30 The region D of 
Example 6. 


Figure 5.31 The region D of 
Example 6 as a type 1 region. 


Figure 5.32 The region D of 
Example 6 as a type 2 region. 
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We continue analyzing examples of double integral calculations. 


EXAMPLE 6 Let D be the region shown in Figure 5.30 having a triangular 
border. Consider ff (1 — x — y)dA. Note that D is a type 3 elementary region, 
so there should be two ways to evaluate the double integral. 

Considering D as a type 1 elementary region (see Figure 5.31), we may apply 
part 1 of Theorem 2.10 so that 


[fa-s-yaas f [u-x-yavan 
1 


7 y2\ pal 
= Í (> —xy- z) 2 dx 
1 aai 
=f (« x)—x(1—x) a = as 
0 


1 2 
(x) 
=| z Oe = a(1—x)*|, i. 


We can also consider D as a type 2 elementary region, as shown in Figure 5.32. 
Then, using part 2 of Theorem 2.10, we obtain 


[fa-s-na= f [7 a-r- yaray: 


We leave it to you to check explicitly that this iterated integral also has a value of 


. Instead, we note that 
1 l—x 
i | (l — x — y)dy dx 
0 Jo 


1 l-y 
/ Í (l — x — y)dx dy 
o Jo 


by exchanging the roles of x and y. Hence, the two integrals must have the same 
value. In any case, the double integral 


[[a-s-»aa 


represents the volume under the graph of z = 1 — x — y over the triangular region 
D. Ifwe picture the situation in R?, as in Figure 5.33, we see that the double integral 
represents the volume of a tetrahedron. + 


can be transformed into 


Of course, not all regions in the plane are elementary, including even some 
relatively simple ones. To integrate continuous functions over such regions, the 
best advice is to attempt to subdivide the region into finitely many of elementary 
type. 
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x~ (1,0, 0) 


+y=4 


Figure 5.33 The double integral 


of Example 6 represents the 
volume of the tetrahedron. 


Figure 5.34 The region D of Figure 5.35 The region D of Example 7 
Example 7. subdivided into four elementary regions. 


EXAMPLE 7 Let D be the annular region between the two concentric circles 
of radii 1 and 2 shown in Figure 5.34. Then D is not an elementary region, but we 
can break D up into four subregions that are of elementary type. (See Figure 5.35.) 
If f(x, y) is any function of two variables that is continuous (hence integrable) 
on D, then we may compute the double integral as the sum of the integrals over 
the subregions. That is, 


J|tia=ff sans ff rans ff cars ff paa 


For the type 1 subregions, we have the set-up shown in Figure 5.36: 


fdA= = ee ads 
ae 


and 


Jf ra= f f sedas 


For the type 2 subregions, we use the set-up shown in Figure 5.37: 


fdA= ! eon 
Də -1d./1-y? 


Figure 5.36 The subregions Dı and Figure 5.37 The subregions Dz and D4 of 
D; of Example 7 are of type 1. Example 7 are of type 2. 


-1 


Figure 5.38 The region D 
of Example 8. 
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J [saan f’ [OT teasa 


The difficulty of evaluating each of the preceding four iterated integrals then 
depends on the complexity of the integrand. + 


and 


EXAMPLE 8 We calculate ffp ydA, where D is the region bounded by the 
line x — y = 0 and the parabola x = y* — 2. (See Figure 5.38.) 

In this case D is a type 2 elementary region, where the left and right boundary 
curves may be expressed as x = y* — 2 and x = y, respectively. These curves 
intersect where 


y—-2=y => y?-y-2=0 4 y=-1,2. 


Therefore, part 2 of Theorem 2.10 applies to give 


2 y 
Tf yda= | / y dx dy 
D —lJy?—2 


2 y 2 
= xyi, dy= | (y?—(* —2)y) dy 
={ y2=2: 7 =! 


Now, although D is not a type 1 elementary region, it may be divided into 
two type 1 subregions along the vertical line x = —1. (See Figure 5.39.) The 
subregion D; lying left of the line x = —1 is bounded on both top and bottom by 
the parabola x = y? — 2; by solving for y we may express the bottom boundary 
of Dı as y = —J/x + 2 and the top boundary as y = yx + 2. The subregion 
Dy lying right of x = —1 is bounded on the bottom by y = x and on the top by 


Figure 5.39 The region D of Example 8 is divided 
into subregions D; and D> by the line x = —1. 
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y = /x + 2. Putting all this information together, we have 


[fvaa= ff vaas ff vaa 


-1 pv¥x42 2 pvx+2 
-| i ydydx + | J ydydx 
—2 —J/x+2 -l Jx 


—1 2.2 ]y=vVx+F2 2 2 )y=vxt2 
y y 
= a dx +f — dx 
29 2 y=—/x+2 =l 2 y=x 


=l 2 /x+2 x? 
= Odx 4 d 
l, i J 2 3) ” 


Addendum: Proof of Theorem 2.6 


Step 1. First we establish Theorem 2.6 in the case where f is continuous on 
R = [a, b] x [c, d]. By Theorem 2.4, we know that f fẹ f dA exists. Let F be 
the single-variable function defined by 


d 
F(x) = / FEI: 


(Note: Since f is continuous on R, the partial function in y obtained by holding 


x constant is continuous on [c, d]. Hence, f “ F(x, y)dy exists for every x in 
[a, b].) We show that 


[ ros- [f rea] dx= |f faa. 


Let a = x9 < xı < -> < Xn = b be any partition of [a, b]. Then a general 
Riemann sum that approximates J P F(x)dx is 


5 F(x; Ax, (1) 
i=l 


where Ax; = x; — xj-; and xë € [x;-1, xj]. Nowletc = yo < yi < < y} =d 
be a partition of [c, d]. (The partitions of [a, b] and [c, d] together give a partition 
of R = [a, b] x [c, d].) Therefore, we may write 


d 
F(x) = f fle, y)dy 
d 


= | to.yay+ [fen sdy ee fx, ay 


y Yn-1 


n Yj 
= >| Teds 
j=l YY- 
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By the mean value theorem for integrals,’ on each subinterval [y;—1, yj] there 
exists a number y} such that 


y 


f(x, ydy = Oj -y DSE, YD) = SO, YAY; 


Yj-1 


The choice of y; in general depends on x, so henceforth, we will write y;(x) for 
y; Consequently, 


F(x) = D> f(x, yi) Ay, 


j=l 


and the Riemann sum (1) may be written as 


2o F(x; Ax; = >| fj, Vj OP NAY; (Ax = > f(c) Axi Ay;, 
izl 1 ij=l 


i=1 (j= 


where c; = (x7, y;(4;)). Note that ej; € [xi-1, xi] x [yj-1, yj]. (See Fig- 
ure 5.40.) 


Yi ° cy? 


t ° x 
5-1 XP Xx; b 


Figure 5.40 The point ¢;; = (x7, ył(xř)) used in 
the proof of Theorem 2.6. 


We have thus shown that given any partition of [a, b], we can associate a 
suitable partition of R = [a, b] x [c, d] such that the Riemann sum (1) that ap- 
proximates J. _ F(x)dx is equal to a Riemann sum (namely, )°; j f(c) Ax; Ay;) 
that approximates ffp f dA. Since f is continuous, we know that 


SF (ciAxi Ay; approaches Jf raa 
i,j R 


as Ax; and Ay; tend to zero. Hence, 


f reac= ff raa: 


By exchanging the roles of x and y in the foregoing argument, we can show 


that a 
ff raf f f(x, y)dx dy. 


' The mean value theorem for integrals says that, if g is continuous on [a, b], then there is some number 
c with a < c < b such that J? g(x)dx = (b —a)g(c). 


330 


Chapter 5 | Multiple Integration 


Step 2. Now we prove the general case of Theorem 2.6 (i.e., the case that 
f has discontinuities in R = [a, b] x [c, d]). By hypothesis, the set S of discon- 
tinuities of f in R are such that every vertical line meets S in at most finitely 
many points. Thus, for each x in [a, b], the partial function in y of f(x, y) is 
continuous throughout [c, d], except possibly at finitely many points. (In other 
words, the partial function is piecewise continuous.) Then, because f is bounded, 


d 
F(x) = J Tada 


exists. 
Now we proceed as in Step 1. That is, we begin with a partition of [a, b] into 
n subintervals and a corresponding Riemann sum 


D F(x¥)Axi. 
i=l 


Next, we partition [c, d] into n subintervals. Hence, 


n y 


d j 
ran= f fet »dy= Sf fet ay. D 


ga. yji 


As in Step 1, the partitions of [a, b] and [c, d] combine to give a partition of R. 
Write R as Rı U R2, where R; is the union of all subrectangles 


Rij = [xi-1, Xi] x [yj-1, y;] 
that intersect S and Rz is the union of the remaining subrectangles. Then we may 
apply the mean value theorem for integrals to those intervals [y;—1, yj] on which 
f(x}, y) is continuous in y, thus replacing the integral 


[ fot. nay 
by 
SOF, VIP Ay; = F(ci)Ay;- 
Since f is bounded, we know that 


If(x, yl < M 
for some M and all (x, y) € R. Therefore, on the intervals [y;_1, yj] where 
f(x}, y) fails to be continuous, we have 


ft, y)dy 


yj-1 


< a |F, y)| dy 


si- 


yj 
=j M dy = M(y; — yj-1) = M Ayj. (3) 
yj-1 


From equation (2), we know that 


yD F(x; )Ax; 
i=l 


DI Foss, yy] 


i,j=l 


> if Fos, yy Ay 


RijCRiUR2 D 
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SI fost. ay} an 
RijCRi 


es te Foss. day] As 


RijCR2 


Therefore, 


Ax; — | Foss, yd dy] 
x 2 x*, y)dy Į Ax 


Sf Foss, yay} Asi) 
RjcRi 


Applying the mean value theorem for integrals to the left side of equation (4) and 
inequality (3) to the right side, we obtain 


(4) 


jp AX — om f(cijJAxi Ay;| < 5 M Axi Ay; 
RijCR2 RijCRI 
= M -areaof R}. 


Now S has zero area (by hypothesis) and is contained in R. By letting the 
partition of R become sufficiently fine (i.e., by making Ax;, Ay; small), the term 
M - area of R; can be made arbitrarily small. (See Figure 5.41.) 


y 


d+ 


a b 


Figure 5.41 The set Rı (shaded area) consists of 
the subrectangles of the partition of R that meet S, 
the set of discontinuities of f on R. As the partition 
becomes finer, the area of R, tends toward zero. 


Therefore, as all Ax; and Ay; tend to zero, we have that the sums 
So F(f)Ax; and >> f(ei)AxAy;, 
i RijCR2 
and the term M - area of R; converge (respectively) to 


b 
[ Fea, J fdA, and 0. 
R 


a 
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We conclude that 


f Fœ- ff raa=o. 
ff raaf f ra navar 


Again, by exchanging the roles of x and y, we can show that 


ff raaf f todas 


that is, 


as well. E 
5.2 Exercises 
1. Use Definition 2.3 and Theorem 2.4 to determine a ioe e 
the value of f f(y? + sin2y) dA, where R = [0, 3] x 8. [ Í., (x + y“) dy dx 
—1, 1]. 
[ ] 2. 
2. Let R = [—3,3] x [-2, 2]. Without explicitly eval- 9. f | x sin (y?) dx dy 
uating any iterated integrals, determine the value of 0 0 
(x5 + 2y) dA. m psinx 
SIr 10. f f y cos x dy dx 
3. This problem concerns the double integral ffp x? dA, o Jo 
where D is the region pictured in Figure 5.42. Vict 
(a) Determine /'/,, x° dA directly by setting up and ex- 11. Í Í 3dydx 
plicitly evaluating an appropriate iterated integral. 0 J-VI-x 


(b) Now argue what the value of ff, x°dA must be = 
by inspection, that is, without resorting to explicit 12. Í l  3dx dy 
-1 J0 


calculation. 
y e 
13. / i y` dy dx 
0 —e* 


14. Figure 5.43 shows the level curves indicating the vary- 
y=4-x? ing depth (in feet) of a 25 ft by 50 ft swimming pool. 
Use a Riemann sum to estimate, to the nearest 100 fe z 

the volume of water that the pool contains. 


D 
25 
x 
20 
Figure 5.42 The region D of 
Exercise 3. 15 T 
In Exercises 4—13, evaluate the given iterated integrals. In ad- 10 ri 6 isd 
dition, sketch the regions D that are determined by the limits 
of integration. 5 i 
l l 
1 E 2 py? I fi 
a. | f 3 dy dx 5. | | ydx dy 0 
0 Jo 0 Jo 
Figure 5.43 
2 px? 3 2x+1 
6. | i ydydx 7. | / xy dy dx 15. Integrate the function f(x, y) = 1 — xy over the trian- 
0 JO —lex gular region whose vertices are (0, 0), (2, 0), (0, 2). 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 
24. 
25. 
26. 


27. 


28. 


29. 


30. 


Integrate the function f(x, y) = 3xy over the region 
bounded by y = 32x? and y = \/x. 


Integrate the function f(x, y) = x + y over the region 
bounded by x + y = 2 and y? — 2y — x = 0. 
Evaluate ff, xy dA, where D is the region bounded 
by x = y? and y = x?. 


Evaluate ffp e* dA, where D is the triangular region 
with vertices (0, 0), (1, 0), and (1, 1). 


Evaluate ffp 3y dA, where D is the region bounded 
by xy? = 1, y = x, x = 0, and y = 3. 

Evaluate ff (x —2y)dA, where D is the region 
bounded by y = x? + 2 and y = 2x? — 2. 


Evaluate She? + y?) dA, where D is the region in the 
first quadrant bounded by y = x, y = 3x, and xy = 3. 


Prove property 2 of Proposition 2.7. 
Prove property 3 of Proposition 2.7. 
Prove property 4 of Proposition 2.7. 


(a) Let D be an elementary region in R?. Use the 
definition of the double integral to explain why 
J fp 1dA gives the area of D. 


(b) Use part (a) to show that the area inside a circle of 


radius a is za’. 


Use double integrals to find the area of the region 


bounded by y = x? and y = x°. 


Use double integrals to calculate the area of the region 
bounded by y = 2x, x = 0, and y = 1 — 2x — x?. 


Use double integrals to calculate the area inside 
the ellipse whose semiaxes have lengths a and b. 
(See Figure 5.44.) 


(0, b) 


(a, o) 


Figure 5.44 The ellipse of 
Exercise 29. 


(a) Set up an appropriate iterated integral to find 
the area of the region bounded by the graphs of 
y =x? — x and y = ax? for x > 0. (Take a to be 


a constant.) 


Q (b) Useacomputer algebra system to estimate for what 


value of a this area equals 1. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 
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Use double integrals to find the total area of the region 
bounded by y = x° and x = y°. 


Use double integrals to find the area of the region 
bounded by the parabola y = 2 — x’, and the lines 
x-—y=0,2x+y=0. 


Let D be the region in R? bounded by the lines x = 0, 
x +y = 3, and x — y = 3. Without resorting to any 
explicit calculation of an iterated integral, determine, 
with explanation, the value of ff (° — e? siny + 
2)dA. (Hint: Use symmetry and geometry.) 


Let D be the region in R? with y > 0 that is 
bounded by x? + y? = 9 and the line y = 0. Without 
resorting to any explicit calculation of an iterated 
integral, determine, with explanation, the value of 
Shh — y* sinx — 2) dA. 


Determine the volume ofthe solid lying under the plane 
z = 24 — 2x — 6y and over the region in the xy-plane 
bounded by y = 4 — x?, y = 4x — x?, and the y-axis. 


Find the volume under the portion of the paraboloid 
z = x? + 6y? lying over the region in the xy-plane 


bounded by y = x and y = x? — x. 


Find the volume under the plane z = 4x + 2y +25 
and over the region in the xy-plane bounded by y = 
x? — 10 and y = 31 — (x — 1). 


(a) Set up an iterated integral to compute the volume 
under the hyperbolic paraboloid z = x? — y? +5 
and over the disk 


D = {(x, y) |x? +y <4} 


in the xy-plane. 


Q (b) Use a computer algebra system to evaluate the 


39. 


40. 


integral. 
Find the volume of the region under the graph of 
f(x, y) =2— |x| — Iyl 
and above the x y-plane. 


(a) Show that if R = [a, b] x [c,d], f is continuous 
on [a, b], and g is continuous on [c, d], then 


[[ 1044 
z ( | “fdx) ( / ‘20)4)) | 


(b) What can you say about 


J| fosoa 


if D is not a rectangle? More specifically, what if 
D is an elementary region of type 1? 
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41. Let coordinates. Then to what value must this Riemann 
i: ifx is rational sum converge as both Ax; and Ay; tend to zero? 
ee eae (d) Partition R and construct a Riemann sum by choos- 
f(x,y)= 40 ifx is irrational and y < 1. ing test points ¢;; = (xj, y;) such that x7 ae 
2 if x is irrational and y > 1 if y; < | and x* is irrational if y > 1. What hap- 
5 pens to this Riemann sum as both Ax; and Ay; 

(a) Show that Jo f(x, y)dy does not depend on tend to zero? 
whether x is rational or irrational. (e) Show that f fails to be integrable on R by us- 
(b) Show that ie i, f(x, y)dy dx exists and find its ing Definition 2.3. Thus, we see that double inte- 
value. grals and iterated integrals are actually different 

(c) Partition R = [0,1] x [0,2] and construct a notions. 


Riemann sum by choosing “test points” ¢;; in each 
subrectangle of the partition to have rational x- 


5.3 Changing the Order of Integration 


Frequently, it is useful to think about the evaluation of double integrals over 
elementary regions essentially as the determination of an appropriate order of 
integration. When the region of integration is a rectangle, Fubini’s theorem 
(Theorem 2.6) says the order in which we integrate has no significance; that is, 


ff sera [fi se.navacs f f tasara 


(See Figure 5.45.) When the region is elementary of type 1 only, we must integrate 
first with respect to y (and then with respect to x) if we wish to evaluate the double 
integral by means of a single iterated integral. (See Figure 5.46.) Then 


b p8(x) 
Tf faa= | f(x, y)dy dx. 
D a y(x) 


In the same way, when the region is elementary of type 2 only, we would typically 
integrate first with respect to x, so that 


[fre -f [i f(x, y)dx dy. 


y=7(x) a 


R= [a,b] x [c, d] 


Figure 5.45 Changing the order of integration over a Figure 5.46 A type 1 region 


leads us to integrate with respect 
to y first. 


Figure 5.47 A type 2 region 
leads to integration with respect 
to x first. 


Figure 5.48 The region D of 
Example 1. 


1 


Figure 5.49 Integrating over 
the region D of Example | by 
integrating first with respect to x. 
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(See Figure 5.47.) When the region is elementary of type 3, however, we can 
choose either order of integration, at least in principle. Often, this flexibility can 
be used to advantage, as the following examples illustrate: 


EXAMPLE 1 We calculate the area of the region shown in Figure 5.48. Con- 
sidering D as a type 1 region, we obtain 


e Inx 
Area of D= |f Lda (Why?) = | / l dy dx 
D 1 Jo 
=, yl dx = | Inx dx. 
1 1 


The single definite integral that results gives the area under the graph of y = ln x 
over the x-interval [1, e], just as it should. To evaluate this integral, we need to 
use integration by parts: Let u = In x (so du = 1/x dx) and dv = dx (so v = x). 
Then 


e ë e 1 
Area of D = f Inxdx =Inx-x|; — f x-—dx 
1 1 X 
(remember f udv = u - v — f v du), so 


AraofD=e-0- | dx =e —(e—1)=1. 
1 


Integration by parts can be avoided if we integrate first with respect to x, as 
schematically suggested by Figure 5.49. Hence, 


1 e 1 1 
Area of D= | ida = f l Ldxdy = Í ray = f (e — e”)dy 
D o Je 0 0 


yy 
=(ey—e)|,=@-9)-0-2) =1, 
which checks (just as it should). + 


Note that the two iterated integrals we used to calculate the area in Example 1, 


namely, 
e Inx 1 e 
T] dydx and [ [ asas, 
1 0 0 e 


are not obtained from each other by a simple exchange ofthe limits of integration. 
The only time such an exchange is justified is when the region of integration 
is a rectangle of the form [a, b] x [c,d] so that all limits of integration are 
constants. 


EXAMPLE 2 Sometimes changing the order of integration can make an im- 
possible calculation possible. Consider the evaluation of the following iterated 


integral: 
2 p4 
/ J y cos(x?) dx dy. 
0 y2 


After some effort (and maybe some scratchwork), you should find it impossible 
even to begin this calculation. In fact it can be shown that cos(x?) does not 
have an antiderivative that can be expressed in terms of elementary functions. 
Consequently, we appear to be stuck. 
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y On the other hand, it is easy to integrate y cos(x?) with respect to y. This 
suggests finding a way to change the order of integration. We do so in two steps: 


1. Use the limits of integration in the original iterated integral to identify the 
region D in R? over which the integration takes place. (While doing this, you 
should make a wish that D turns out to be a type 3 region.) 


2. Assuming that the region D in Step 1 is of type 3, change the order of 
integration. 


The limits of integration in the preceding example imply that D can be described as 


Figure 5.50 Note that x = y? D={(x 2<x<4,0<y<2 
corresponds to y = alt over the K ? y) ly i ý h 
region shown. as suggested by Figure 5.50. Now Figure 5.50 can be used to change the order 


of integration. We have 


2 p4 4 pJx 
I J y cos(x?) dx dy = / | y cos(x?) dy dx. 
0 Jy 0 Jo 


It is now possible to complete the calculation; that is, 


4 px 47.2 
2 _ 1 2 
| Í y cos(x" )dy dx =] (3 cos(x ) 


4y > 
= = cos(x*) dx 
o 2 


1 16 
= if cosu du, 
0 


where u = x? and du = 2x dx, so that, finally, 


ya/x 
dx 


y=0 


2 p4 
| / y cos(x?) dx dy = 1 sinu|, = i sin 16. + 
0 y? 


The technique of changing the order of integration is a very powerful one, but 
it is by no means a panacea for all cumbersome (or impossible) integrals. It relies 
on an appropriate interaction between the integrals and the region of integration 
that often fails to occur in practice. 


5.3 Exercises 


N 


1. Consider the integral ae is 
A Í | (2 — x — y)dydx 
2 p2x 0 s0 
f (2x + 1)dy dx. 2 p4-2x 
0 Jx è f | ydydx 
o Jo 
(a) Evaluate this integral. 2 p4-y? 
A | / x dx dy 
o Jo 
(c) Write an equivalent iterated integral with the order 
of integration reversed. Evaluate this new integral 


9 73 
. . f | (x + y)dx dy 
and check that your answer agrees with part (a). 0 Jy 


3 e 
f / 2dydx 
0 1 


w 


A 


(b) Sketch the region of integration. 


u 


In Exercises 2-9, sketch the region of integration, reverse the 
order of integration, and evaluate both iterated integrals. 


oO 


1 2y 
7. i J e“ dx dy 
0 y 
m/2 peosx 
8. / f sin x dy dx 
0 0 
2 pa/4-y? 
9. / J ydxdy 
0 J—,/4-y? 
When you reverse the order of integration in Exercises 10 and 


11, you should obtain a sum of iterated integrals. Make the 
reversals and evaluate. 


1 =% 
10. / / (x — y)dydx 
—2 Jx?-2 


4 pdy-y? 
11. J / (y+ l)dxdy 
—1 J y—4 


In Exercises 12 and 13, rewrite the given sum of iterated in- 
tegrals as a single iterated integral by reversing the order of 
integration, and evaluate. 


1 x 2, 2-x 
12. f f sinxdydx + f | sin x dy dx 
0 Jo 1 Jo 


8 PAB 12 pVyB 
13. l f ydxdy+ Í / y dx dy 
0 0 8 /y-8 


In Exercises 14-18, evaluate the given iterated integral. 


1 3 
14. / / cos (x°) dx dy 
0 J3y 
1 1 
15. [ / x? sin xy dx dy 
0 y 
16. f / SIX dx dy 
0 y x 
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3 9—x? xe? 
17. / Í dy dx 
0 J0 9-y 
2 pl i 
18. | / e™ dxdy 
0 Jy/2 


Itis interesting to see what a computer algebra system does with 
iterated integrals that are difficult or impossible to integrate in 
the order given. In Exercises 19-21, experiment with a com- 
puter to evaluate the given integrals. 


Q 19. (a) Determine the value of lA Le y? cos (xy) dy dx 
via computer. Note how long the computer takes 
to deliver the answer. Does the computer give you 
a useful answer? 


(b) If you were to calculate the iterated integral in part 
(a) by hand, in the order it is written, what method 
of integration would you use? (Don’t actually carry 
out the evaluation, just think about how you would 
accomplish it.) 


(c) Now reverse the order of integration and let your 
computer evaluate this iterated integral. Does your 
computer supply the answer more quickly than in 
part (a)? 

Q 20. (a) See if 
3 r9 : 2 ee 
J So x sin (y^) dy dx as it is written. 

(b) Now reverse the order of integration and have 

your computer evaluate your new iterated integral. 


Which of the computations in parts (a) or (b) is 
easier for your computer? 


your computer can calculate 


9 21. (a) Can your computer evaluate h 1 n 


sin™! y 


eS*dx dy? 


(b) Reverse the order of integration and have it try 
again. What happens? 


5.4 Triple Integrals 


Let f(x, y, z) be a function of three variables. Analogous to the double integral, 
we define the triple integral of f over a solid region in space to be the limit of 
appropriate Riemann sums. We begin by defining this integral over box-shaped 
regions and then proceed to define the integral over more general solid regions. 


The Integral over a Box 


Let B be a closed box in R? whose faces are parallel to the coordinate planes. 
That is, 


B ={(x,y,z ER |a<x<b,c<y<d, p<z<q}. 


x 


Figure 5.51 The box 
B = [a,b] x [c,d] x [p,q]. 


(See Figure 5.51.) We also use the following shorthand notation for B: 
B = [a, b] x [c, d] x [p,q]. 
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a= Xo =n 
x4 
b ae 
Z1 
cC=yy y) d=y, 
y 


Figure 5.52 A partitioned box. 


DEFINITION 4.1 A partition of B of order n consists of three collections 
of partition points that break up B into a union of n> subboxes. That is, for 
i, j,k =0,...,m, we introduce the collections {x;}, {y;}, and {zg}, such that 


a) = iy Soe SK 35 Kha SH Roose SH, =O 

C—=Yo=oi => = Vj =); = = yn — Ge, 

ao ee 
(See Figure 5.52.) In addition, tonn j,k =1,...,n, let 


Axi = xi — Xii, AY; =yj— yj- and AZk= Zk — Zk. 


DEFINITION 4.2 Let f be any function defined on B = [a, b] x [c, d] x 
[p,q]. Partition B in some way. Let ¢;;, be any point in the subbox 

Bijx = Di-1, xi) x [yj-1, yj] X ei a GG olk = 1,--., 7). 
Then the quantity 


n 


S= 2 ICDA 


i, j,k=1 


where AV;jk = Ax; Ay; Az, is the volume of B;;jg, is called the Riemann 
sum of f on B corresponding to the partition. 


You can think of the Riemann sum }` f(¢;;<)AVj jx as a weighted sum of 
volumes of subboxes of B, the weighting given by the value of the function f at 
particular “test points” ¢;;, in each subbox. 


DEFINITION 4.3 The triple integral of f on B, denoted by 


oa 


by o TED or by // UC) ENGNG, 


pa Se 


Figure 5.53 The subbox 
contributes f(¢;j.)AVijx 
to the Riemann sum S. If 
we think of f as 
representing a density 
function, then the total 
mass of the entire box B 


is [ffa f aV. 


Figure 5.54 In 

Theorem 4.4 the 
discontinuities of f on B 
(shown shaded) must have 
zero volume. 
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is the limit of the Riemann sum S as the dimensions Ax;, Ay;, and Az, of 
the subboxes B;;, all approach zero, that is, 


n 
= li TAAA A 
J LAY = sre ano De SOMO AY, Aa 


Je 
provided that this limit exists. When fff f dV exists, we say that f is 
integrable on B. 


The key point to remember is that the triple integral is the limit of Riemann 
sums. It is this notion that enables useful and important applications of integrals. 
For example, if we view the integrand f as a type of generalized density function 
(“generalized” because we allow negative density!), then the Riemann sum S is 
a sum of approximate masses (densities times volumes) of subboxes of B. These 
approximations should improve as the subboxes become smaller and smaller. 
Hence, we can use the triple integral [ffa f dV, when it exists, to compute the 
total mass of a solid box B whose density varies according to f, as suggested by 
Figure 5.53. 

Analogous to Theorem 2.5, we have the following result regarding integra- 
bility of functions: 


THEOREM 4.4 If f is bounded on B and the set of discontinuities of f on B 
has zero volume, then fff f dV exists. (See Figure 5.54.) 


To say that a set X has zero volume as we do in Theorem 4.4, we mean that 
we can cover X with boxes B1, Bo,..., By, ... (ie., so that X C LJ, Bn), the 
sum of whose volumes can be made arbitrarily small. 

To evaluate a triple integral over a box, we can use a three-dimensional version 
of Fubini’s theorem. 


THEOREM 4.5 (FUBINI’S THEOREM) Let f be bounded on B = [a,b] x 
[c,d] x [p,q] and assume that the set S of discontinuities of f has zero vol- 
ume. If every line parallel to the coordinate axes meets S in at most finitely many 
points, then 


[fray 

B 
b pd q b qd d 

=f | f fæyadayar=f f f fæ yddydzdx 
a Je p a Jp de 


d b rq d q b 
=| | | tævoaaas=f f f fy ddrdzay 
c a Jp c p Ja 
q b d q d b 
=f / | f(x, y, z)dy dx dz =| / | f(x; y, z) dx dy dz. 
p va de p dc a 
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W 
| ; 
L. 
x 
Figure 5.55 The function f is 
continuous on W. 


z=W(y) 


x 


Figure 5.56 An elementary 
region of type 1. 


x= ay) x= Bly) 


Figure 5.57 The “shadow” 
(projection) of W into the x y-plane 
should be an elementary region in 
the plane. 


EXAMPLE 1 Let 
B = [—2,3] x [0,1] x [0,5], andlet f(x, y,z)= xe” + xyz. 


Thus, f is continuous and hence certainly satisfies the hypotheses of Fubini’s 


theorem. Therefore, 


3 5 
1 (x7e? + xyz)dV | | / (x7e? + xyz) dzdy dx 
B -2J0 Jo 


3 
= f f (ect bye) Zayas 


3 
= [| (5x7 + 3 xy) dy dx 
3 


=) (5x°e + Bry?) dx 


3 
= J (5(e — 1)x? + Fx)dx 


2 


= (Se -= D + FE, 


(45(e — 1) + 28) — (-2(e -1)+ 3) 


Be-1+. 


II 


You can check that integrating in any of the other five possible orders produces 
the same result. hd 


Elementary Regions in Space - 


Now suppose W denotes a fairly arbitrary solid region in space, like a rock or a 
slab of tofu. Suppose f is a continuous function defined on W, such as a mass 
density function. (See Figure 5.55.) Then the triple integral of f over W should 
give the total mass of W. As was the case with general double integrals, we need to 
find a way to properly define fff w f 4V and to calculate it in practical situations. 
The course of action is much like before: We see how to calculate integrals over 
certain types of elementary regions and treat integrals over more general regions 
by subdividing them into regions of elementary type. 


DEFINITION 4.6 We say that W is an elementary region in space if it can 
be described as a subset of R? of one of the following four types: 


Type 1 (see Figures 5.56 and 5.57) 


(a) W={(x, y,z) | 9G, y)<z5 0, y), ve) sy 55%), a<x < dt, 
or 


(b) W={@, y,z)|e@,y)<z< WG, y), ay) =x < BO), cx y < dh. 
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Type 2 (see Figure 5.58) 


(a) W —1G.),2) |e@O;2)=4 5 Py.<). V@)s yad@), D aa S 
or 
(b) WIG ye D ey 2x = 80,2), PO) S42 a VO), cay aq). 


Type 3 (see Figure 5.59) 


(a) Wo m l A N S y S00, 2) Ol = 24 = PG), ps2 = 4), 
or 
(b) W={(, y,2|v@.2 sy $5G@,2), op) S25 W(X), asx <5}. 


Type 4 
W is of all three previously described types. 


y=y(x,z) y= ô(x, z) 


Figure 5.58 For an elementary Figure 5.59 For an elementary 
region of type 2, the shadow in the region of type 3, the shadow in the 
yz-plane should be an elementary xz-plane should be an elementary 
region in the plane. region in the plane. 


Some explanation regarding Definition 4.6 is in order. An elementary region 
W of type | is a solid shape whose top and bottom boundary surfaces each can 
be described with equations that give z as functions of x and y and such that 
the projection of W into the xy-plane (the “shadow”) is in turn an elementary 
region in R? (in the sense of Definition 2.8). Similarly, an elementary region of 
type 2 is one whose front and back boundary surfaces each can be described with 
equations giving x as functions of y and z and whose projection into the yz-plane 
is an elementary region in R?. Finally, an elementary region of type 3 is one whose 
left and right boundary surfaces each can be described with equations giving y 
as functions of x and z and whose projection into the xz-plane is an elementary 
region in R°. In each case, an elementary region in space is one for which we 
can find boundary surfaces described by equations where one of the variables is 
expressed in terms of the other two, and whose “shadow” in the plane of these 
two variables is an elementary region in R? in the sense of Definition 2.8. 


EXAMPLE 2 Let W be the solid region bounded by the hemisphere x? + y? + 
z? = 4, where z < 0, and the paraboloid z = 4 — x” — y?. (See Figure 5.60.) It 
is an elementary region of type 1 since we may describe it as 


W= [eyd] -V4-P yy s2s4-27- 9%, 
—V4—x2<y<V4-x?, -2 <x <2}. 
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x24 y24 72=4, 
z<0 Shadow of W 


Figure 5.60 The solid region W of Example 2. 


This description was obtained by noting that W is bounded on top and bottom by a 
pair of surfaces, each of which is the graph of a function of the form z = g(x, y) 
and the shadow of W in the xy-plane is a disk D of radius 2, which we have 
chosen to describe as 


D= |») | -V4 =x? < y < V4- x, -2 <x <2}, 


and which we already know is an elementary region (of type 3) in the xy- 
plane. + 


EXAMPLE 3 The solid bounded by the ellipsoid 


x2 2 2 


Z sa 
E: zt = += =l, a,b,c positive constants 
a b2 ě e 


can be seen to be an elementary region of type 4. To see that it is of type 1, split 
the boundary surface in half via the z = 0 plane as shown in Figure 5.61. (This 
is accomplished analytically by solving for z in the equation for the ellipsoid.) 
Then the shadow D of E is the region inside the ellipse in the xy-plane shown in 
Figure 5.62. 


y 
x 
Figure 5.61 The ellipsoid of Figure 5.62 The shadow of 
Example 3 as an elementary the type 1 ellipsoid in 
region of type 1. Figure 5.61 is the region inside 


the ellipse x? /a? + y?/b? = 1 
in the xy-plane. 
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Z 


Zz 
y 
x x2 2 
yeno ae 
Figure 5.63 The ellipsoid of Figure 5.64 The shadow Figure 5.65 The ellipsoid of 
Example 3 as a type 2 elementary of the ellipsoid in Example 3 as a type 3 region. 
region. Figure 5.63 is the region 
inside the ellipse 
y?/b? + 27/c? = 1 in the 
yz-plane. 
We have 
x2 x2 
D= (x,y) | =b l- <y <byl-—=,-a<x<a 
a a 
y? y? 
= (x,y) | -a le axsa le ay Se? ; 
so D is in fact a type 3 elementary region in R?. 
z To see that E is of type 2, split the boundary at the x = 0 plane as in 


Figure 5.66 The 
shadow of the 
ellipsoid in 
Figure 5.65 in the 
xz-plane. 


Figure 5.63. The shadow in the yz-plane is again the region inside an ellipse. 
(See Figure 5.64.) Finally, to see that E is of type 3, split along y = 0. (See 
Figures 5.65 and 5.66.) + 


Triple Integrals in General 


Suppose W is an elementary region in R? and f is a continuous function on W. 
Then, just as in the case of double integrals, we define the extension of f by 


f(x,y,z) if(x,y,z) €W 


mena] 0 ify Ig W 


By Theorem 4.4, f°** is integrable on any box B that contains W. Thus, we can 
make the following definition: 


DEFINITION 4.7 Under the assumptions that W is an elementary region 
and f is continuous on W, we define the triple integral 


[ff sav to be Le 


where B is any box containing W. 
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Using a proof analogous to that of Theorem 2.10, we can establish the 
following: 


THEOREM 4.8 Let W be an elementary region in R? and f a continuous 
function on W. 


1. If W is of type 1 (as described in Definition 4.6), then 


b pd(x) wW(x,y) 
Tf jaya / J ihai pet 
W a Jy(x) Je(x,y) 
or 
d PBO) pwx.y) 
I fav =| J / f(x, y, z)dzdx dy. (type 1b) 
WwW c a(y) J@(x.y) 


2. If W is of type 2, then 


q 5(z) pBly,z) 
II pav=| / J endida Tepes) 
W p vy(z) Ya(y,z) 


or 
d py) fB) 

Tf sav =| J | f(x, y,z)dxdzdy. (type 2b) 
wW c Joy) Yaly,z) 


3. If W is of type 3, then 


q pB) pêz) 
Jff saw=f | | fæddi iye 
W p Ja(z) Yy(x,z) 
or 
b pw(x) pêz) 
Tf fav=Í / J f(x, y,z)dydzdx. (type 3b) 
wW a Jox) y&,z) 


EXAMPLE 4 Let W denote the (solid) tetrahedron with vertices at (0, 0, 0), 
(1, 0, 0), (0, 1, 0), and (0, 0, 1) as shown in Figure 5.67. Suppose that the mass 
density at a point (x, y, z) inside the tetrahedron varies as f(x, y,z) = 1 + xy. 
z We will use a triple integral to find the total mass of the tetrahedron. 
The total mass M is 


[ff ra= ff [0+ av. 


(See the remark before Theorem 4.4.) To evaluate this triple integral using iterated 
(1,0,0) integrals, note that we can view the tetrahedron as a type 1 elementary region. 
x (Actually, it is a type 4 region, but that will not matter.) The slanted face is given 
Figure 5.67 The tetrahedron of by the equation x + y + z = 1, which describes the plane that contains the three 
Example 4. points (1, 0, 0), (0, 1, 0), and (0, 0, 1). Hence, by first integrating with respect to 


(0, 1, 0) 


Line x+y=1 
a (in z = 0 plane) 


Figure 5.68 The shadow in the 
xy-plane of the tetrahedron of 
Example 4 is a triangular region. 


Figure 5.69 The region W of 
Example 5. 
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z and holding x and y constant, 


l—x—y 
M = / p (1+ xy)dz) dA 
shadow 0 


= Jf a +xy)\(l — x — y)dA 


=l (1 x—y+xy— xy xy’) dA. 
shadow 


The shadow of W in the x y-plane is just the triangular region shown in Figure 5.68. 
Thus, 


m=Jf (l-x-y+xy—x’y—xy*)dA 
shadow 


1 1—x 
=i (1 —x— y+ xy —x°y — xy’) dy dx 
o Jo 
1 


=, (a x)—x(l— x) i1 xy + $x(1 xy 
0 


Iria = lal x)3) dx 


Note that M can also be written as a single iterated integral, namely, 


1 1—x l—x—y 
m= f J / (1+ xy)dzdydx. é 
o Jo 0 


EXAMPLE 5 We calculate the volume of the solid W sitting in the first octant 
and bounded by the coordinate planes, the paraboloid z = x? + y? + 9, and the 
parabolic cylinder y = 4 — x”. (See Figure 5.69.) 

By definition, the triple integral is a limit of a weighted sum of volumes of 
tiny subboxes that fill out the region of integration. If the weights in the sum are 
all taken to be 1, then we obtain an approximation to the volume: 


Va S01: AVi. 


i,j,k 


Therefore, taking the limit as the dimensions of the subboxes all approach zero, 


it makes sense to define 
y= / J / lav. 
w 
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y In our situation, W is a type 1 region whose shadow in the xy-plane looks 
(0, 4) like the region shown in Figure 5.70. Thus, by Theorem 4.4, 
2 4—x? x? +y?+9 
pew vo ffp E ewe 
w o Jo 0 
2 4—x? 3 3 
P a / (x° + y" + 9) dy dx 
(2, 0) ae 
; 2 
Figure 5.70 The shadow 2 1.3 y=4—x? 
in the xy-plane of the = Í (x Pray + I ) dx 


region in Figure 5.69. 


14 — x7) + 9(4 — x°)) dx 


2 
= / (x°(4 — x?) 
0 


2 
=f (2 21x? + 3x4 ix°) dx 


_ (12 341 345_ 1,7\|2 _ 2512 
= ( x — 7x? + 5x a Vo = 35 + 


z EXAMPLE 6 Wefindthe volume inside the capsule bounded by the paraboloids 


z=9-x2?-y2 z=9-— x? -— y? andz = 3x? + 3y? — 16. (See Figure 5.71.) 
Once again, we have 
W 
v= / J / lav, 
/ Ww 


y and again the region W of interest is elementary of type 1. The shadow, or pro- 
jection, of W in the xy-plane is determined by 


{(x, y) € R? | there is some z such that (x, y, z) € W}. 


Physically, one can also imagine the shadow as the hole produced by allowing W 


z=3x2+3y2-16 to “fall through” the xy-plane. In other words, the shadow is the widest part of W 
perpendicular to the z-axis. From Figure 5.71, one can see that it is determined 


Figure 5.71 The capsule-shaped by the intersection of the two boundary paraboloids. The shadow itself is shown 
region of Example 6. in Figure 5.72. The intersection may be obtained by equating the z-coordinates 
of the boundary paraboloids. Therefore, 


i 9— x? — yt =3x°+3y?-16 => 427 44y? = 25 
x24 y2= 2 Paen x2 yi = 8 = (iP. 
= Thus, by Theorem 4.4, 
5/2 pa/25/4=x2 p9—x?—-y? 
paid O ae 


5/2 J—./25/4—x2 J 3x?+3y?—16 
Figure 5.72 The shadow / /4—x? J 3x?+3y 
of the region W of 


/ =. eee ate 
Example 6, obtained by = af f aunt [ os 
0 0 3 


dzdy dx. 


projecting the intersection 
curves of the defining 

paraboloids onto the This last iterated integral represents the volume of one quarter of the capsule. 
xy-plane. Hence, we multiply its value by 4 to obtain the total volume. The reason for this 


x?+3y?—16 
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manipulation is to make the subsequent calculations somewhat simpler (although 
the computation that follows is clearly best left to a computer). 
We compute 


5/2. a/ 25/4—x? 9—x?-y? 
v=4f | i dzdydx 
0 0 3 


x?+3y2—16 


5/2  pa/25/4—x2 
= af i (25 — 4x* — 4y°)dy dx 
0 0 


32 R 25 2\3/2 
-3/ (7x) dx. 


Now let x = 5 sind, so dx = 5 cos d0. Then 


32 [72/5 35 1250 (72 4} 
V = — — cos E cos” d0 
0 


3 2 0 
1250 (72 (1 ? 
= zii 2 
3S, (x + cos o) dé 
625 [7P 
== (1 + 2cos 20 + cos” 20) d0 
0 


625 £ 625 [7/21 
=- (6 + sin20)7? 4 Í 501 + cos 40) dé 
0 


6 
625 625 / T 6257 
= r+ (Z t0) =. + 
12 12 \2 8 
5.4 Exercises 
Evaluate the triple integrals given in Exercises 1—3. 3 I a dv 
[1,e]x[l,e]x[1,e] XYZ 
1; I xyzdV 
[-1,1]x[0,2]x{[1,3] 4. Find the value of fffyzdV, where W = [—1, 2] x 


[2,5] x [-3,3], without resorting to explicit 
2. I a? +y +z )dV calculation. 
[0, 1]x[0,2]x [0,3] 
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Evaluate the iterated integrals given in Exercises 5—7. 


5 


© 


N 


10. 


2 pz py+z 
J / f 3yz dx dy dz 
-1J1 Jo 
3 pz pxz 
i | I] (x + 2y + z)dy dx dz 
1 Jo Ji 
1 2y ytz 
Š w / zdxdzdy 
0 Ji+ydz 


(a) Let W be an elementary region in R*. Use the 
definition of the triple integral to explain why 
J Sw 14V gives the volume of W. 


(b) Use part (a) to find the volume of the region 
W bounded by the surfaces z = x? + y? and z = 
9—x?—y?, 

Use triple integrals to verify that the volume of a ball 


of radius a is 4ra? /3. 


Use triple integrals to calculate the volume ofa cone of 
radius r and height h. (You may wish to use a computer 
algebra system for the evaluation.) 


In Exercises 11—20, integrate the given function over the indi- 


cated 


11 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


region W. 


. f(x, y,z) = 2x —y+z; W is the region bounded 
by the cylinder z = y*, the xy-plane, and the planes 
x 0, x l; y 2, y 2. 


f(x, y,z)=y; W is the region bounded by the 
plane x + y+z=2, the cylinder x? +z? = 1, and 
y=0. 


f(x, y,z) = 8xyz; W is the region bounded by 
the cylinder y = x?, the plane y +z = 9, and the 
xy-plane. 


f(x, y,z) =z; W is the region in the first octant 
bounded by the cylinder y? + z? = 9 and the planes 
y=x,x =0,andz=0. 


f(x,y,z) = 1 — 27; W is the tetrahedron with vertices 
(0, 0,0), (1, 0, 0), (0, 2, 0), and (0, 0, 3). 


f(x, y, Z) = 3x; W is the region in the first octant 
bounded by z = x? + y?, x = 0, y = 0, and z = 4. 


f(x,y,z) =x+y; W is the region bounded by 
the cylinder x? +3z? =9 and the planes y= 0, 
x+y=3, 


f(x, y,z) =z; W is the region bounded by z = 0, 
x? + 4y? = 4, and z = x + 2. 


(x, y, Z) = 4x + y; W is the region bounded by x = 
2 


y,y=z,x=y,andz=0. 

f(x, y,z) =x; W is the region in the first octant 
bounded by z = x? +2y?, z=6-x? y, x=0, 
and y = 0. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


Find the volume of the solid bounded by z = 4 — x’, 
x + y = 2, and the coordinate planes. 


Find the volume of the solid bounded by the planes y = 
0,z = 0, 2y +z = 6, and the cylinder x? + y? = 9. 
Find the volume of the solid bounded by the paraboloid 
z = 4x? + y? and the cylinder y? + z = 2. 

Find the volume of the region inside both of the cylin- 
ders x? + y? = a? and x? + z? =a’. 


Consider the iterated integral 


1 1 1—x 
/ / f(x, y,z)dzdx dy. 
-1Jy Jo 


Sketch the region of integration and rewrite the inte- 
gral as an equivalent iterated integral in each of the five 
other orders of integration. 


Change the order of integration of 


1 1 x 
f | f f(x, y, z)dzdx dy 
o Jo Jo 


to give five other equivalent iterated integrals. 


Change the order of integration of 


[ ff tes oecavas 


to give five other equivalent iterated integrals. 


Consider the iterated integral 


2 p3V36—9x? p36—4x?—4y? 
f | f 2dzdy dx. 
0 0 5x2 


(a) This integral is equal to a triple integral over a solid 
region W in R°. Describe W. 


(b) Setup an equivalent iterated integral by integrating 
first with respect to z, then with respect to x, then 
with respect to y. Do not evaluate your answer. 

(c) Setup an equivalent iterated integral by integrating 
first with respect to y, then with respect to z, then 
with respect to x. Do not evaluate your answer. 

(d) Now consider integrating first with respect to y, 
then x, then z. Set up a sum of iterated integrals 
that, when evaluated, give the same result. Do not 
evaluate your answer. 

(e) Repeat part (d) for integration first with respect to 
x, then z, then y. 


Consider the iterated integral 


2 påv4=x pay? 
/ f | (x3 + y?)dzdy dx. 
—2 J0 x? +3y? 


(a) This integral is equal to a triple integral over a solid 
region W in R°. Describe W. 
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(b) Set up an equivalent iterated integral by integrating (d) Now consider integrating first with respect to x, 
first with respect to z, then with respect to x, then then y, then z. Set up a sum of iterated integrals 
with respect to y. Do not evaluate your answer. that, when evaluated, give the same result. Do not 

(c) Setup an equivalent iterated integral by integrating evaluate your answer. 
first with respect to x, then with respect to z, then (e) Repeat part (d) for integration first with respect to 
with respect to y. Do not evaluate your answer. y, then z, then x. 


5.5 Change of Variables 


As some of the examples in the previous sections suggest, the evaluation of a mul- 
tiple integral by means of iterated integrals can be a complicated process. Both the 
integrand and the region of integration can contribute computational difficulties. 
Our goal for this section is to see ways in which changes in coordinates can be 
used to transform iterated integrals into ones that are relatively straightforward to 
calculate. We begin by studying the coordinate transformations themselves and 
how such transformations affect the relevant integrals. 


Coordinate Transformations 


Let T: R? —> R? be a map of class C! that transforms the uv-plane into the xy- 
plane. We are interested particularly in how certain subsets D* of the uv-plane 
are distorted under T into subsets D of the x y-plane. (See Figure 5.73.) 


v y 
D* D=T(D*) 


Figure 5.73 The transformation T(u, v) = (x(u, v), y(u, v)) 
takes the subset D* in the uv-plane to the subset D = {(x, y) | 
(x, y) = T(u, v) for some (u, v) € D*} of the xy-plane. 


EXAMPLE 1 Let T(u, v) = (u + 1, v + 2); that is, let x = u + 1, y = v +2. 
This transformation translates the origin in the uv-plane to the point (1, 2) in the 
xy-plane and shifts all other points accordingly. The unit square D* = [0, 1] x 
[0, 1], for example, is shifted one unit to the right and two units up but is otherwise 
unchanged, as shown in Figure 5.74. Thus, the image of D* is D = [1, 2] x [2, 3]. 

+ 
EXAMPLE 2 Let S(u, v) = (2u, 3v). The origin is left fixed, but S stretches all 
other points by a factor of two in the horizontal direction and by a factor of three 
in the vertical direction. (See Figure 5.75.) + 


EXAMPLE 3 Composing the transformations in Examples 1 and 2, we obtain 
(T o S)(u, v) = T(2u, 3v) = (Qu + 1, 3v + 2). 


Such a transformation must both stretch and translate as shown in Figure 5.76. +% 
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v y 
ga 
T 
2T DITO) 
1 D* it 
a: CES l 
1 1 2 3 


Figure 5.74 The image of D* = [0, 1] x [0, 1] is 
D = [1, 2] x [2, 3] under the translation 
T(u, v) = (u + 1, v + 2) of Example 1. 


x 


D = S(D*) = [0, 2] x [0, 3] 


Figure 5.75 The transformation S of Example 2 is a scaling 
by a factor of 2 in the horizontal direction and 3 in the vertical 


direction. 
y 
5+ 
v 44 
cae 
Tos 
oe FA sd 
D=([1, 3] x [2, 5] 
p 
‘ll = ER 
u t t t— X 
1 2 3 


Figure 5.76 Composition of the transformations of 
Examples 1 and 2. 


EXAMPLE 4 Let T(u, v) = (u + v, u — v). Because each of the component 
functions of T involves both variables u and v, it is less obvious how the unit 
square D* = [0, 1] x [0, 1] transforms. We can begin to get some idea of the 
geometry by seeing how T maps the edges of D*: 


Bottom edge: (u,0), O<u<1; T(u,0)= (u, u); 
Top edge: (u,1) O<u<1; Tw,l=(ust+l,u—-1); 
Left edge: (0, v), O<uv<1; T(0,v)= (v, —v); 
Right edge: (1, v) O<v<1; T(l,v)=(v+1,1-— v). 


By sketching the images of the edges, it is now plausible that the image of D* 
under T is as shown in Figure 5.77. + 


Figure 5.77 The transformation T of Example 4. 
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More generally, we consider linear transformations T: R? —> R? defined 
by 


TU, v) = (aut bv eu + dv)=| 4 ala 


v 


where a, b, c, and d are constants. (Note: The vector (u, v) is identified with the 


2 x 1 matrix 7 .) One general result is stated in the following proposition: 


PROPOSITION 5.1 Let A = | : ; | , where det A # 0. If T: R? > R? is de- 
fined by 


Ta =A] | 


then T is one-one, onto, takes parallelograms to parallelograms and the vertices 
of parallelograms to vertices. (See §2.1 to review the notions of one-one and onto 
functions.) Moreover, if D* is a parallelogram in the uv-plane that is mapped 
onto the parallelogram D = T(D*) in the xy-plane, then 


Area of D = | det A| - (Area of D*). 


EXAMPLE 5 We may write the transformation T(u, v) = (u + v, u — v) in 


Example 4 as 
1 1 u 


l 1 
det| | _| |=-2#0. 


Note that 


Hence, Proposition 5.1 tells us that the square D* = [0, 1] x [0, 1] must be 
mapped to a parallelogram D = T(D*) whose vertices are 


T(0,0)= (0,0), T0,1)=(1,-1, TA,0=(1,1) TA,1)=(2,0). 


Therefore, Figure 5.77 is indeed correct and, in view of Proposition 5.1, could have 
been arrived at quite quickly. Also note that the area of D is |— 2|- 1=2. + 


Proof of Proposition 5.1 First we show that T is one-one. So suppose 
T(u, v) = Tu’, v’). We show that then u = u’, v = v’. We have 


T(u, v) = Ti’, v’) 
if and only if 
(au + bu, cu + dv) = (au' + bv’, cu’! + du’). 
By equating components and manipulating, we see this is equivalent to the system 


a(lu— u’) + b(v — v) =0 


c(u — u’) +d(v — v) =0 (1) 


352 Chapter 5 | Multiple Integration 


Figure 5.78 The vertices of 

D* ={p+sa+tb|0<s,t< 
1} are at p, p + a, p + b, 

p +a + b (i.e., where s and t take 


on the values 0 or 1). 


Aa 


Figure 5.79 The image D of the 
parallelogram D* under the linear 
transformation T(u) = Au. 


If a Æ 0, then we may use the first equation to solve for u — u’: 


b 
u —u' = —-(v— v’) (2) 
a 
Hence, the second equation in (1) becomes 
b 
f(y —v) +d(v—v) =0 
a 
or, equivalently, 
—b d 
eG, — v’) =0. 


a 


By hypothesis, det A = ad — bc # 0. Thus, we must have v — v’ = 0 and, there- 
fore, u — u' = 0 by equation (2). If a = 0, then we must have both b Æ 0 and 
c Æ 0, since det A 4 0. Consequently, the system (1) becomes 


b(v — v’) =0 
c(u— u’) +d(v — v) =0 ` 


The first equation implies v — v’ = 0 and, hence, the second becomes c(u — u’) = 
0, which in turn implies u — u' = 0, as desired. 

To see that T is onto, we must show that, given any point (x, y) € R?, we can 
find (u, v) € R? such that T(u, v) = (x, y). This is equivalent to solving the pair 
of equations 


au+bv=x 
| cu+dv=y 
for u and v. We leave it to you to check that 
_ dx — by 
ad — bc 


ee) en: 
~ ad — be 


v 


will work. 
Now, let D* be a parallelogram in the uv-plane. (See Figure 5.78.) Then D* 
may be described as 


D* = {u|u = p+ sa+tb, 0<s <1, 0<t <I}. 


Hence, 
D = T(D*) = {Au | u € D} 
= {A(pt+sat+tb)|0<s<1,0<r<}} 


= {Ap+sAa+tAb|0<s<1,0<t<1)}. 


If we let p’ — Ap, a’ = Aa, and b’ = Ab, then 
D={p +sa +tb]0<s<1,0<t <1} 


Thus, D is also a parallelogram and, moreover, the vertices of D correspond to 
those of D*. (See Figure 5.79.) 
Finally, note that the area of the parallelogram D* whose sides are parallel to 


a= [| and bafa | 
a2 bz 
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may be computed as follows: 


i j k 
Area of D* = |a x b|| = | det] a) a 0 = |aibz — anh |. 
b b 0 


Similarly, the area of D = T(D*) whose sides are parallel to 
f b! 
a’ = K and b’=| ! 
ay b, 


Area of D = ||a' x b’|| = |a; b, — ab) |. 
Now, a’ = Aa and b’ = Ab. Therefore, 


a} _{[a_ b a, |__| aai + ba 
a’, cfc d ma | | ca+da |’ 
bo | [a b][ bh |_| a+b, 
b | |c d|| b| | cbi+db | 


Hence, by appropriate substitution and algebra, 
Area of D = |(aaı + baz)(cb; + db2) — (ca, + daz)(abı + bb2)| 


= |(ad — bc)(a,b2 — a2b1)| 
= | det A| - area of D*. 


and 


Note that we have not precluded the possibility of D*’s being a “degenerate” 
parallelogram, that is, such that the adjacent sides are represented by vectors a 
and b, where b is a scalar multiple of a. When this happens, D will also be 
a degenerate parallelogram. The assumption that det A ~ 0 guarantees that a 
nondegenerate parallelogram D* will be transformed into another nondegenerate 
parallelogram, although we have not proved this fact. E 


Essentially all of the preceding comments can be adapted to the three- 
dimensional case. We omit the formalism and, instead, briefly discuss an example. 


EXAMPLE 6 Let T:R? — R? be given by 
T(u, v, w) = (2u, 2u + 3v + w, 3w). 


Then we rewrite T by using matrix multiplication: 


20 0 u 
T(u, v, w)=| 2 3 1 v 
003 
Note that if 
200 
A=|]23 1], 
0 0 3 


then det A = 18 Æ 0. 
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A result analogous to Proposition 5.1 allows us to conclude that T is one-one 
and onto, and T maps parallelepipeds to parallelepipeds. In particular, the unit 
cube 


D* = [0, 1] x [0, 1] x [0, 1] 


is mapped onto some parallelepiped D = T(D*) and, moreover, the volume of D 
must be 


| det A| - volume of D* = 18-1= 18. 

To determine D, we need only determine the images of the vertices of the cube: 
T(0, 0,0) = (0,0,0); TCU, 0,0) = (2,2,0); TO, 1,0) = (0, 3, 0); 
T(0,0, 1) = (0, 1,3); Td,1,0) = (2,5,0); TQ, 0, 1) = (2, 3, 3); 
T(0, 1,1) = (0,4, 3); TC, 1,1) = (2, 6, 3). 


Both D* and its image D are shown in Figure 5.80. + 


Figure 5.80 The cube D* and its image D under the 
linear transformation of Example 6. 


EXAMPLE 7 Ofcourse, not all transformations are linear ones. Consider 
(x, y) = T(r, 0) = (r cos 9, r sin 0). 


Note that T is not one-one since T(0, 0) = (0, 0) = T(0, x). (Indeed T(0, 6) = 
(0, 0) for all real numbers 0.) Note that vertical lines in the r@-plane given by 
r =a, where a is constant, are mapped to the points (x, y) = (a cos 0, a sin 0) 
on a circle of radius a. Horizontal rays {(r, 0) | 0 =a,r > 0} are mapped to 
rays emanating from the origin. (See Figure 5.81.) It follows that the rectangle 
DF = [5 1] x [0, x] in the r6-plane is mapped not to a parallelogram, but bent 


0 y 
r=a Image of 
0 = 
0=a T ys 
1m 

r x 
Image of 
r=a 


Figure 5.81 The images of lines in the r@-plane under the 
transformation T(r, 0) = (r cos 0, r sin 0). 
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0 y 
m 
T 
1 
D=T(D") z 
r x 

1 1 

2: r 
Figure 5.82 The image of the rectangle D* = [, 1] x [0, x] Figure 5.83 The image of B* = [5. 1] x [0, x] x [0, 1] 


under T(r, 0) = (r cos 0, r sin 0). 


under T(r, 0, z) = (r cos 8, r sin@, z). 


into a region D that is part of the annular region between circles of radii 5 and 1, 
as shown in Figure 5.82. 
Analogously, the transformation T: R? —> R? given by 


(x, y,z) = T(r, 6, z) = (r cos, r sin 0, z) 


bends the solid box B* = [5. 1] x [0, x] x [0, 1] into a horseshoe-shaped solid. 
(See Figure 5.83.) + 


Change of Variables in Definite Integrals 


Now we see what effect a coordinate transformation can have on integrals and how 
to take advantage of such an effect. To begin, consider a case with which you are 
already familiar, namely, the method of substitution in single-variable integrals. 


EXAMPLE 8 Consider the definite integral R 2x cos(x?) dx. To evaluate, one 
typically makes the substitution u = x? (so du = 2x dx). Doing so, we have 
u=4 

= sin 4. 


u= 


2 4 
| 2x cos(x?) dx = / cosudu = sinu 
0 0 
Let’s dissect this example more carefully. First of all, the substitution u = x? 


may be rewritten (restricting x to nonnegative values only) as x = y/u. Then 
dx = du/(2./u) and 


2 4 4 
d 
| 2x cos(x? )dx = / 2/u cos( vm? = | cosu du = sin 4. 
0 0 2/u 0 


In other words, the substitution is such that the 2x = 2./u factor in the integrand 
is canceled by the functional part of the differential dx = du/(2./u). Hence, a 
simple integral results. + 


In general, the method of substitution works as follows: Given a (perhaps 
complicated) definite integral f a f(x)dx, make the substitution x = x(u), where 
x is of class C!. Thus, dx = x'(u) du. If A = x(a), B = x(b), and x'(u) 0 for 
u between a and b, then 


B b 
/ ore J Flx(u))x!(u) du. 8) 
A a 
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Figure 5.84 As Au = du — 0, 

Ax — dx = x'(u) Au. Thus, the factor 
x’(u) measures how length in the w-direction 
relates to length in the x-direction. 


Note that it is possible to have a > b in (3) above. (This happens if x(u) is 
decreasing.) Although the u-integral in equation (3) may at first appear to be more 
complicated than the x-integral, Example 8 shows that in fact just the opposite 
can be true. 

Beyond the algebraic formalism of one-variable substitution in equation (3), 
it is worth noting that the term x'(u) represents the “infinitesimal length distortion 
factor” involved in the changing from measurement in u to measurement in x. 
(See Figure 5.84.) We next attempt to understand how these ideas may be adapted 
to the case of multiple integrals. 


The Change of Variables Theorem for Double Integrals —— 


Suppose we have a differentiable coordinate transformation from the uv-plane to 
the xy-plane. That is, 


T:R? > R?, Tu, v) = (x(u, v), y(u, v)). 


DEFINITION 5.2 The Jacobian of the transformation T, denoted 


d(x, y) 
O(u, v)’ 
is the determinant of the derivative matrix DT(u, v). That is, 
ox Ox 
a(x, ax ə ax ð 
a) evan dec) oe ee 
(u, v) dy ay du ðv dv ðu 
ðu av 


The notation d(x, y)/d(u, v) for the Jacobian is a historical convenience. The 
Jacobian is not a partial derivative, but rather the determinant of the matrix of 
partial derivatives. It plays the role of an “infinitesimal area distortion factor” 
when changing variables in double integrals, as in the following key result: 


x 
(8, 0) 


Figure 5.85 The triangular region 
D of Example 9. 
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THEOREM 5.3 (CHANGE OF VARIABLES IN DOUBLE INTEGRALS) Let D and D* 
be elementary regions in (respectively) the xy-plane and the uv-plane. Suppose 
T: R? = R? is a coordinate transformation of class C! that maps D* onto D 
in a one-one fashion. If f: D — R is any integrable function and we use the 
transformation T to make the substitution x = x(u, v), y = y(u, v), then 


a(x, y) 
If, f(x, y)dxdy = TT. f(x(u, v), yan op |e dudv. 


EXAMPLE 9 We use Theorem 5.3 to calculate the integral 


Tf cos(x + 2y) sin(x — y)dx dy 
D 


over the triangular region D bounded by the lines y = 0, y = x,andx + 2y = 8as 
shown in Figure 5.85. It is possible to evaluate this integral by using the relatively 
straightforward methods of §5.2. However, this would prove to be cumbersome, 
so, instead, we find a suitable transformation of variables, motivated in this case by 
the nature of the integrand. In particular, we let u = x + 2y, v = x — y. Solving 
for x and y, we obtain 


u + 2v u—v 
and y= i 
3 3 


x= 


Therefore, 


OGD) dee Xu Xv | det 
a(u, v) Yu Yv 


Wo] w= 
Wile 


Considering the coordinate transformation as a mapping T(u, v) = (x, y) of 
the plane, we need to identify a region D* that T maps in a one-one fashion 
onto D. To do this, essentially all we need do is to consider the boundaries of D: 


Hence, one can see that T transforms the region D* shown in Figure 5.86 onto 
D. Therefore, applying Theorem 5.3, 


Figure 5.86 The effect of the transformation T of Example 9. 
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d(x, y) 


Tf cos(x +2y)sin( = )dxdy = | f cos u sin v 
D * d(u, v) 
-|/ cosu sin v |—4|dudv 
D* 
8 u 
=] / + cos u sin v dv du 
0 Jo 
8 


Í 1 cos u (— cos v)|)—5 du 


du dv 


II 


8 
= if cos u(— cosu + 1)du 
0 
8 
= if (cos u — cos? u) du 
0 


8 
= [sinu -f 4a + cos 2u) du | 
0 


=4 [sing — (Su + + sin2u)|5| 


= {[sin8 — 4 — }sin 16]. 


There is another, faster way to calculate the Jacobian, namely, to calculate 
d(u, v)/d(x, y) directly from the variable transformation, and then to take reci- 
procals. That is, from the equations u = x + 2y, v = x — y, we have 


a(u, v) = Ux Uy | _ 1 2) _ 
TE = det | de Vy | = darf 1-1 | = —3. 


Consequently, d(x, y)/d(u, v) = —i, which checks with our previous result. 


This method works because if T(u, v) = (x, y), then, under the assumptions of 
Theorem 5.3, (u, v) = T~!(x, y). It follows from the chain rule that 


DT! (x, y) = [DT(u, v)J"!. 


(That is, DT~! is the inverse matrix of DT. See Exercises 30-38 in §1.6 for more 
about inverse matrices.) Hence, 


a(x, y) 
oa(u, v) 


= det (DT | = det [(DT)'] = PETG 
+ 


EXAMPLE 10 We use Theorem 5.3 to evaluate f f (x? — y’) e® dx dy, where 
D is the region in the first quadrant bounded by the hyperbolas xy = 1, xy = 4 
and the lines y = x, y = x + 2. (See Figure 5.87.) 


2 
I 4” 
Figure 5.88 The region D* 


corresponding to the region D of 
Example 10. 
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y y=x+2 


Figure 5.87 The region D of Example 10. 


Both the integrand and the region present complications for evaluation. There 
would seem to be two natural choices for ways to transform the variables. One 
would be 


u=x°— y? and v=xy, 
motivated by the nature of the integrand. However, the region D of integration 
will not be easy to describe in terms of this particular choice of uv-coordinates. 
Another possible transformation of variables, motivated instead by the shape of 
D, is 


u=xy and v=y-x. 


Now this change of variables would not seem to help much with the integrand, 
but, as we shall see, it turns out to be just what we need. 

First note that the boundary hyperbolas xy = 1 and xy = 4 correspond, re- 
spectively, to the lines u = 1 andu = 4; the lines y = x and y = x + 2 correspond 
to v = 0 and v = 2. Thus, the region D* in the uv-plane that corresponds to D 
(see Figure 5.88) is 


D* = {(u,v)|1<u<4, 0<v <2}. 


Next, we calculate that the Jacobian of the variable transformation is 


haan = det] = i | =x+y. 
Hence, the Jacobian we require in order to use Theorem 5.3 is 
əx, y) sl 
(u,v) x+y 


Moreover, since we will be working in the first quadrant (where x and y are both 
positive), |3(x, y)/d(u, v)| = 1/Œx + y). 
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At last we are ready to compute: 


[fe — ye dxdy= | | œ - ye (— -) uav 


-f f * (x — y)(x + y) e dudv 
x+y 
-f [ —ve" dudv 
2 
-f —v(et—e!)\dv= e e')| = Xe — eô). 
0 0 


Note that the insertion of the Jacobian in the integrand caused precisely the can- 
celation needed to make the evaluation straightforward. We cannot always expect 
this to happen, but the lesson here is to be willing to carry through calculations 
that may not at first appear to be so easy. + 


EXAMPLE 11 (Double integrals in polar coordinates) In Example 9, a coor- 
dinate transformation was chosen primarily to simplify the integrand of the double 
integral. In this example we change variables by using a coordinate system better 
suited to the geometry of the region of integration. 

For example, suppose that the region D is a disk of radius a: 


= {(x, y) |x’? +y? <a} 
= f@, y) -v=x <y < ax? -a <x <a}. 


Then, to integrate any (integrable) function f over D in Cartesian coordinates, 
one would write 


Tf fle. y)dxdy= | i E 
D —a J —Ja2—x?2 


Even if it is easy initially to find a partial antiderivative of the integrand, the limits 
in the preceding double integral may complicate matters considerably. This is be- 
cause the disk is described rather awkwardly by Cartesian coordinates. We know, 
however, that it has a much more convenient description in polar coordinates as 


{(r,0)|0<r<a,0<6 <27}. 
This suggests that we make the change of variables 
(x, y) = T(r, 0) = (r cos 0, r sind), 


which is shown in Figure 5.89. (Note that T maps all points of the form (0, @) to 
the origin in the xy-plane and, thus, cannot map D* in a one-one fashion onto D. 
Nonetheless, the points of D* on which T fails to be one-one fill out a portion 
of a line—a one-dimensional locus—and it turns out that it will not affect the 
double integral transformation.) The Jacobian for this change of variables is 


d(x, y) -d ee —r sin@ 


= 2 z ER 
a(r,0) sinô rcos@ | =rcos 0 +rsmw 0 =r. 


Figure 5.90 The disk of 
radius a centered at the 
origin. 


is 


(0,2 
) x?+y?=4 


(2, 0) 


Figure 5.91 The region 
D of Example 13. 


5.5 | Change of Variables 361 


Figure 5.89 T maps the (nonclosed) rectangle D* to 
the disk D of radius a. 


(Note that r > 0 on D, so |r| = r.) Thus, using Theorem 5.3, the double integral 
can be evaluated by using polar coordinates as follows: 


[f revan- f [Oo roa 


27 a 
=| / f(rcos@,rsin@)rdr dé. 
0 Jo 


It is evident that the limits of integration of the r@-integrals are substantially 
simpler than those in the x y-integral. Of course, the substitution in the integrand 
may result ina more complicated expression, but in many situations this will not be 
the case. Polar coordinate transformations will prove to be especially convenient 
when dealing with regions whose boundaries are parts of circles. + 


EXAMPLE 12 To see polar coordinates “in action,” we calculate the area of a 
circle, using double integrals. Once more, let D be the disk of radius a, centered 
at the origin as in Figure 5.90. Then we have 


a Va2—x? 27 a 
Area = |f \aa= f J dydx = | Í rdr d0, 
D —a J—Ja?—x? 0 0 


following the discussion in Example 11. The last iterated integral is readily eval- 
uated as 


2x pa 27 
/ f r dr d0 = / (472[,)40 = [07 a? d0 = $a°(2x — 0) = xa, 
0 0 0 


which indeed agrees with what we already know. If you feel so inclined, 
compare this calculation with the evaluation of the iterated integral in Cartesian 
coordinates. No doubt you’ll agree that the use of polar coordinates offers clear 
advantages. + 


EXAMPLE 13 We evaluate the double integral ffp Vx? +y? + 1dxdy, 
where D is the quarter disk shown in Figure 5.91, using polar coordinates. The 
region D of integration is given in Cartesian coordinates by 


D={(x,y)|0<y<vV4—x?, 0<x<2}, 


so that 


4—x? 


E 
I Vry ds dy = | Vx? + y? +l1dydx. 
D o Jo 
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This iterated integral is extremely difficult to evaluate. However, D corresponds 
to the polar region 


D* = (0.0) |0 sr <2, 0< 0 = x/2}. 


Therefore, using Theorem 5.3, we have 


Tf vty + idxay= |f Vr2cos20 +r?sin?0 + 1-rdr do 
D D* 


m/2 p2 

=f / yr? +1rdrdé 
0 0 
m/2 

-f Lr? + 1) do 
m/2 

-f 7(5*/? — 1)d0 


T 
=F ad): ‘ 
Sketch of a proof of Theorem 5.3 Let (uo, vo) be any point in D* and let 
Au = u — ug, Av = v — vo. The coordinate transformation T maps the rectangle 
R* inside D* (shown in Figure 5.92) onto the region R inside D in the x y-plane. 
(In general, R will not be a rectangle.) Since T is of class C!, the differentiability 
of T (see Definition 3.8 of Chapter 2) implies that the linear approximation 


h(u, v) = T(uo, vo) + DT (uo, vo) | “o | 
v — Vo 


= T(uo, vo) T DT(uo, vo) | | 


-~ R=T(R’) 


T(uo, vo) 


bE e 


= 


Figure 5.92 T takes a rectangle R* inside D* to a region R inside D. 


is a good approximation to T near the point (uo, vo). In particular, h takes the rect- 
angle R* onto some parallelogram P that approximates R as shown in Figure 5.93. 
We compare the area of R* to that of P. 

From Figure 5.93, we see that the rectangle R* is spanned by 


5.5 | Change of Variables 363 


y 

b=|° R i a. mI b 

H | (uo, vo) 
oa 


P=h(R’) 


R=T(R’) 


(up; Uo) Au T(uo, vo) = 


a= h(up, vo) DT (up, voja 


Figure 5.93 The linear approximation h takes the rectangle R* onto a parallelogram P 
that approximates R = T(R*). 


and the parallelogram P is spanned by the vectors c = DT(uo, vo)a and d = 
DT(uo, vo)b. Hence, 


Area of R* = |la x b|| = Au Av, 


and thus, by Proposition 5.1, 


d(x, y) 


Au Av. 
d(u, v) — 


Area of P = ||c x d|| = | det DT(uo, vo)| Au Av = | (ug, vo) 


This result gives us some idea how the Jacobian factor arises. 

To complete the sketch of the proof, we need a partitioning argument. Partition 
D* by subrectangles R;;. Then we obtain a corresponding partition of D into (not 
necessarily rectangular) subregions Rj; = T(R}, ). Let AA;; denote the area of Rij. 
Let ¢;; denote the lower left corner of R7; and let dj; = T(¢;;). (See Figure 5.94.) 
Then, since f is integrable on D, 


f| to. varay = im ESADA 


From the remarks in the preceding paragraph, we know that 


d(x, y) 
AA;j X area of parallelogram h(R;;) = | 3u, yy eid) Au; Avj. 
v y 


Figure 5.94 A partition of D* gives rise to a partition of D. 
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0 
y. 
x=r cos 0 
y=rsin 0 4 
dy 1 
dx D 
x 
r x 
Figure 5.95 The “area element” Figure 5.96 The polar-rectangular transformation takes rectangles in the 
dA in rectangular coordinates is r-plane to wedges of disks in the xy-plane. 


dx dy. 


Arclength = rA@ 


oe PP ih toe 


Figure 5.97 An infinitesimal 
polar wedge. 


Taking limits as all the R;; tend to zero (i.e., as Au; and Av; approach zero), we 
find that 


3 
J| tœ »aray = lim df(t (ci) = a E A e) 


Au; Av; 
Au;,Avj>0 


a(x, 2 


dudv, 


II 


II. fou), 96, vy Rees 


as was to be shown. i 


Consider again the polar-rectangular coordinate transformation. When we use 
Cartesian (rectangular) coordinates to calculate a double integral over a region 
D in the plane, then we are subdividing D into “infinitesimal” rectangles having 
“area” equal to dx dy. (See Figure 5.95.) On the other hand, when we use polar 
coordinates to describe this same region, we are subdividing D into infinitesimal 
pieces of disks instead. (See Figure 5.96.) These disk wedges arise from trans- 
formed rectangles in the r@-plane. One such infinitesimal wedge in the x y-plane 
is suggested by Figure 5.97. When A@ and Ar are very small, the shape is nearly 
rectangular with approximate area (r A@) Ar. Thus, in the limit, we frequently say 


dA=dxdy (Cartesian area element) 


=rdrd0 (polar area element). 


Change of Variables in Triple Integrals 


It is not difficult to adapt the previous reasoning to the case of triple integrals. We 
omit the details, stating only the main results instead. 


DEFINITION 5.4 Let T: R? —> R? be a differentiable coordinate transfor- 
mation 


Tu, v, w) = (x(u, v, w), yu, v, w), Z(u, v, w)) 
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from uvw-space to xyz-space. The Jacobian of T, denoted 


a(x, y, Z) 
d(u, v, w)’ 
is det(DT(u, v, w)). That is, 
| 2 Ox ax | 
du dv ow 
we act] oy fy ay 
alu, v, w) ðu ðv ðw 


| dz az az | 

du ðv ðw 

In general, given any differentiable coordinate transformation T: R” —> R”, 
the Jacobian is just the determinant of the derivative matrix: 


ðxı əx əðxı 
Ou; ðu OUn 
ax ax 0X2 
ð EO Dek ia a 
ieee) ee Geer eden) ol Ow dun 
O(Uy, +++, Un) ke & 
Gin z OXn 
ðu duz dün 


THEOREM 5.5 (CHANGE OF VARIABLES IN TRIPLE INTEGRALS) Let W and W* be 
elementary regions in (respectively) xyz-space and uvw-space, and let T: R? > 
R? be a coordinate transformation of class C! that maps W* onto W in a one-one 
fashion. If f: W — R is integrable and we use the transformation T to make the 
substitution x = x(u, v, w), y = y(u, v, w), z = z(u, v, w), then 


Tf a ne ene 


=I f(x(u, v, w), y(u, v, w), z(u, v, E e ôx, y, 2 dudvdw. 


(See Figure 5.98.) 


In the integral formula of the change of variables theorem (Theorem 5.5), the 
Jacobian represents the “volume distortion factor” that occurs when the three- 
dimensional region W is subdivided into pieces that are transformed boxes in 
uvwc-space. (See Figure 5.99.) In other words, the differential volume elements 


366 Chapter 5 | Multiple Integration 


w* er 


u x 


Figure 5.98 A three-dimensional transformation T that takes the solid 
region W* in uvw-space to the region W in x yz-space. 


(i.e., “infinitesimal” pieces of volumes) in xyz- and uvw-coordinates are related 
by the formula 


a(x, y, z) 


——_| dudvdw. 
d(u, v, w) ia 


dV = dx dy dz =| 


wD, 


Figure 5.99 The volume of the “infinitesimal box” in 
uvw-space is du dv dw. The image of this box under T 
has volume |d(x, y, z)/d(u, v, w)| du dv dw. 


EXAIVIPLE 14 (Triple integrals in cylindrical coordinates) When integrat- 
ing over solid objects possessing an axis of rotational symmetry, cylindrical 
coordinates can be especially helpful. The cylindrical-rectangular coordinate 


transformation 
x =rcosé 
y=rsing 
Z =z 


has Jacobian 
a | cos@ —rsind 0 
LIE ae sind rcos@ 0 | =rcos 8 +r sin? 0 =r. 
a(r, 0, z) 0 0 1 


Hence, the formula in Theorem 5.5 becomes 


Tf fæ y ddzdydz= | f f(r cos0,r sind, z)r dr dé dz. 
w we 


5.5 | Change of Variables 367 


In particular, we see that the volume element in cylindrical coordinates is 
dV =r dr d0 dz. 


(Recall that the cylindrical coordinate r is usually taken to be nonnegative. Given 
this convention, we may omit the absolute value sign in the change of variables 
formula.) The geometry behind this volume element is quite plausible: A “dif- 
ferential box” in r@z-space is transformed to a portion of a solid cylinder that is 


nearly a box itself. (See Figure 5.100.) + 
z z 
rdð _ dr 
a D 
dz dz DE 
dr i 1] | 
toii | 
0 a 
Se i, 
r dr <0 


Figure 5.100 A “differential box” in r@z-space is mapped to a portion of a 
solid cylinder in x yz-space by the cylindrical—rectangular transformation. 


EXAMPLE 15 To calculate the volume of a cone of height h and radius a, we 
may use Cartesian coordinates, in which case the cone is the solid W bounded by 


the surface az = h,/x? + y? and the plane z = h, as shown in Figure 5.101. The 
volume can be found by anes the iterated triple integral 


[ a = _dedydx. 
—a Va?—=x? t J+ 


We will forgo the details of the evaluation, noting only that trigonometric substi- 
tutions are necessary and that they make the resulting computation quite tedious. 


z 
z=h 


— > 
= 
N 
+ 
<< 
te 
ll 
a 
N 


az=hvx? + y* 


Shadow in xy-plane 


Figure 5.101 The solid cone W of Example 15. 


In contrast, since the cone has an axis of rotational symmetry, the use of 
cylindrical coordinates should afford us substantially less involved calculations. 
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Figure 5.102 The cone of 
Example 15 described in 
cylindrical coordinates. 


Hence, we consider the cone again. (See Figure 5.102.) Note that 


h 
w= {roa —-r<z<sh,0<r<a, 0<0 <27). 
a 
Thus, the volume is given by 


20 a h 
eat av= | | 1 rdzdr dð. 
Ww 0 Jo Jtr 


(Note the order of integration that we chose.) The evaluation of this iterated 
integral is exceedingly straightforward; we have 


20 a h 2n a h 
/ | / rdzdrao = Í f s(n- 2h) aras 
0 0 Jtr 0 0 a 


Qn h h r=a 
= | (Fr — =") dé 
0 2 3a rÜ 
20 h h 
= Í (fe — ta?) do 
0 2 3 
h 
=2n (20°) = —a’h, 
6 
which agrees with what we already know. + 


EXAMPLE 16 (Triple integrals in spherical coordinates) Ifa solid object has 
a center of symmetry, then spherical coordinates can make integration over such 
an object more convenient. The spherical-rectangular coordinate transformation 


x = p sin g cos 0 
y = psing sind 
Zz = p cCOs o 

has Jacobian 


sing cos pcosgcosd —psing sind 
A(x, y, z) 


= det| singsinO  pcoso sinô psing cosé 
alp, p, 9) 


cos o —p sing 0 
Using cofactor expansion about the last row, this determinant is equal to 
cosg (p? cos’ 0 sing cosy + p° sin’ 6 sing cos g) 
+ psing (p cos’ 0 sin’ o + p sin’ 6 sin? g) 
= p° cosy(sing cosy) + p° sin’ o 
= p’ sing (cos g + sin’ g) 
= p sing. 


(Under the restriction that 0 < g < z, sing will always be nonnegative. Hence, 
the Jacobian will also be nonnegative.) Therefore, the volume element in spherical 


Figure 5.104 The ball B of 
radius a of Example 17. 


Figure 5.105 The cone of 
Example 18 described in 
spherical coordinates. 
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0 Zz 
dé /, 
P pdo 
do rd = psing d0 
~E e 
p x 


Figure 5.103 A differential box in pp0-space is mapped to a portion of a solid ball 
in xyz-space by the spherical—rectangular transformation. 


coordinates is 
dV = p° sing dp dg dé, 


and the change of variables formula in Theorem 5.5 becomes 


Ii E 


~ ITI. F(x(P, p, 0), YP, p, 0), z(P, p, 8)) o? sing dp dg dé. 


The volume element in spherical coordinates makes sense geometrically, because 
a differential box in py6-space is transformed to a portion of a solid ball that is 
approximated by a box having volume p? sing dp dg d0. (See Figure 5.103.) + 


EXAMPLE 17 The volume of a ball is easy to calculate in spherical coordi- 
nates. A solid ball of radius a may be described as 


B={(0,9,9)|0<p<a,0<@<7,0<6 < 27}. 


(See Figure 5.104.) Hence, we may compute the volume by using the triple integral 


27 x a 27 T a 
ITTI av= | | / sinp dododo = | / — sing dọ d0 
B o Jo Jo o Jo 3 


a 20 a 20 
= a (— cos gj) dð = — (—(—1)+ 1)d0 
3 0 3 0 


2a? [7 4ra? 


0 


? 


as expected. + 


EXAMPLE 18 We return to the example of the cone of radius a and height h 
and, this time, use spherical coordinates to calculate its volume. First, note two 
things: (i) that the cone’s lateral surface has the equation g = tan~!(a/h) in spher- 
ical coordinates and (ii) that the planar top having Cartesian equation z = h has 
spherical equation p cos g = h or, equivalently, o = h sec g. (See Figure 5.105.) 
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For fixed values of the spherical angles g and 0, the values of p that give 
points inside the cone vary from 0 to h sec y. Any points inside the cone must 
have spherical angle g between 0 and tan~!(a/h). Finally, by symmetry, 6 can 
assume any value between 0 and 2x . Hence, the cone may be described as the set 


{(0.0.0) |05 p< hseew, 0< <tn! E, 0<0 <n], 


Therefore, we calculate the volume as 


2n tan—!(a/h) phsecy 
f / f p’ sing dp do d0 
0 0 0 


2n tan—!(a/h) h 3 
= i Vee ag) sing dy d0 
0 Jo 3 
h? 27 tan`! (a/h) 
= a 1 sec? y sing dy dé 
3 Jo Jo 


h3 20 tan—!(a/h) 
= =f | tang sec? g dọ do. 


Now, let u = tany so du = sec? y dg. Then the last integral becomes 


h? 2m falh B 271 (a 2 ha? Qn 
= dud} = — =|{-—] d0 = — do 
3 Í | = a dy. 2 (5) 6h? Jo 


2h 
= (2x0) = Sah, 


as expected. + 


The use of spherical coordinates in Example 18 is not the most appropri- 
ate. We merely include the example so that you can develop some facility with 
“thinking spherically.” Further practice can be obtained by considering some of 
the applications in the next section as well as, of course, some of the exercises. 


Summary: Change of Variables Formulas 


Change of variables in double integrals: 


N A(x, y) 
If iC, Maxay = ie f(xy, v), y(u, o) a J) du dv 


Area elements: 


dA E= dxdy (Cartesian) 
= Paid? (polar) 


ð ’ 
-| Gy) dudv_ (general) 


d(u, v) 


i pear eye 


Change of variables in triple integrals: 
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ð 
=| f(x(u, v, w), y(u, v, w), zu, v, w)) Hi 0y. z dudvdw 
w* 
Volume elements: 
dV =dxdydz (Cartesian) 

=rdrd0dz (cylindrical) 

= p sing dp dg d0 (spherical) 
0 b kJ 

= e dudvdw (general) 
d(u, v, w) 

5.5 Exercises 
1. Let T(u, v) = Bu, —v). 6. Suppose T(u, v) = (u, uv). Explain (perhaps by us- 


(a) Write T(u, v) as A | A | for a suitable matrix A. 


(b) Describe the image D = T(D*), where D* is the 7. 
unit square [0, 1] x [0, 1]. 


2. (a) Let 


Tu, v) (“ —v u+ r) 
u, v) = ,—}. 
J2° V2 
How does T transform the unit square D* = 
[0, 1] x [0, 1]? 
(b) Now suppose 


utv —) 8. 
2: «ff 


Describe how T transforms D*. 


wor (UE 


and D* is the parallelogram whose vertices are (0, 0), 
(1, 3), (-1, 2), and (0, 5), determine D = T(D*). 


Tu, v) = ( 


4. If D* is the parallelogram whose vertices are (0, 0), 
(—1, 3), (1, 2), and (0, 5) and D is the parallelogram 9. 
whose vertices are (0, 0), (3,2), (1, —1), and (4, 1), 
find a transformation T such that T(D*) = D 


5. If T(u, v, w) = (3u — v, u — v + 2w, 5u + 3v — w), 
describe how T transforms the unit cube W* = 
[0, 1] x [0, 1] x [0, 1]. 


ing pictures) how T transforms the unit square D* = 
[0, 1] x [0, 1]. Is T one-one on D*? 


Let T: R? —> R? be the transformation given by 
T(p, 9, 9) = (p sing cos9, p sing sind, p cos o). 


(a) Determine D = T(D*), where D* = [0, 1] x 
[0, x] x [0, 27]. 

(b) Determine D = T(D*), where D* = [0, 1] x 
[0, 2/2] x [0, 2/2]. 

(c) Determine D = T(D*), where D* = [1/2, 1] x 
(0, 2/2] x [0, 2/2]. 


This problem concerns the iterated integral 


(y/2)+2 
[ Í, (2x — y)dx dy. 


(a) Evaluate this integral and sketch the region D of 
integration in the xy-plane. 


(b) Let u = 2x — y and v = y. Find the region D* in 
the uv-plane that corresponds to D. 


(c) Use the change of variables theorem (Theorem 5.3) 
to evaluate the integral by using the substitution 
u=2x—y,v=y. 


Evaluate the integral 


(x/2)+1 5 
[ L (2y — xe dy dx 


by making the substitution u = x, v = 2y — x. 
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10. 


11. 


12. 
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Determine the value of 


x+y dA 
x— 2y 


where D is the region in R? enclosed by the lines 
y=x/2,y=0,andx+y= 1. 

Evaluate ff (2x + y)’e*~» dA, where D is the region 
enclosed by 2x +y = 1, 2x+y=4, x-y=-l, 
andx —y= 1. 


2 -37 
Tf Gis) ) dx dy, 
p Qy—x +6? 
where D is the square with vertices (0,0), (2, 1), 


(3, —1), and (1, —2). (Hint: First sketch D and find 
the equations of its sides.) 


Evaluate 


In Exercises 13—17, transform the given integral in Cartesian 
coordinates to one in polar coordinates and evaluate the polar 
integral. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


[ pu 3 dy dx 
vI=x2 
2 V4—x2 
f f dy dx 
0 0 


I (x? + y?)/? dA, where D is the disk x? + y? < 9 
D 


a p= 
l Í e D dx dy 
—a J0 
[ a dy dx 
o Jo yx? +y? 


Evaluate 


1 
—— dA 
J 


where D is the disk of radius 1 with center at (0, 1). 
(Be careful when you describe D.) 


Let D be the region between the square with vertices 
(1, 1),(—1, 1), (—1,—1), (1, —1)and the unit disk cen- 


tered at the origin. Evaluate / / y dA. 
D 


Find the total area enclosed inside the rose r = sin 20. 
(Hint: Sketch the curve and find the area inside a single 
leaf.) 


Let n be a positive integer, and let a be a posi- 
tive constant. Calculate the total area inside the rose 
r = acosné and show that the value depends only on 
a and whether n is even or odd. 


22. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


Find the area of the region inside both of the circles 
r = 2acos@ and r = 2asin@, where a is a positive 
constant. 


. Find the area of the region inside the cardioid r = 


1 — cos 0 and outside the circle r = 1. 


Find the area of the region bounded by the positive 
x-axis and the spiral r = 30,0 < 8 < 27. 


Tf cos(x? + yydA, 
D 


where D is the shaded region in Figure 5.106. 


Evaluate 


Arc of a circle 
of radius 1 
(centered at 


origin) 
` 


2 E Bx 


x 


Figure 5.106 The region D of 
Exercise 25. 


Evaluate / sin (x? + y? ) dA, where D is the region 


D 
in the first quadrant bounded by the coordinate axes 
and the circles x? + y? = 1 and x? + y? = 9. 


X 
Use polar coordinates to evaluate | i ——— dA, 
D / x2 +4 y2 


where D is the unit square [0, 1] x [0, 1]. 


SIX? 
Evaluate f —_ T ——— dz dy dx by 
VIZ J axy? Eoee 


using cylindrical coordinates. 


1 /1—y2 4-x2-y? a 
Evaluate J / f e* 1 ¥: dz dx dy by 
=l J—,/1-y? JO 


using cylindrical coordinates. 


Evaluate 


Laas 
BYP +ytet3 


where B is the ball of radius 2 centered at the origin. 


[fe +y? +227)dV, 


where W is the solid cylinder defined by the inequali- 
ties x? + y? <4,-l<z<2. 


Determine 


32. 


33. 


Zz 
Determine the value of J / f —— dV, where 
w yx? ++ y? 


W is the solid region bounded by the plane z = 12 and 
the paraboloid z = 2x? + 2y? — 6. 


Find the volume of the region W bounded on top by 


z = ya? — x? — y?, on the bottom by the xy-plane, 
and on the sides by the cylinder x? + y? = b?, where 
0<b<a. 


In Exercises 34 and 35, determine the values of the given in- 
tegrals, where W is the region bounded by the two spheres 
xX +y +z? =a and x? + y? +z = b, foro <a <b. 


34. 


35. 


36. 


37. 


Jih rr 

w yx? Hy? Hz? 

JII "ITE ar ay 
Ww 


Let W denote the solid region in the first octant be- 
tween the spheres x? + y? + z? = a? and x? + y? + 
z? = b°, where 0 <a <b. Determine the value of 
Sffy@+y+z)dVv. 


Determine the value of J. f i w z?dV, where W is the 


solid region lying above the cone z = y 3x? + 3y? and 
inside the sphere x? + y? + 2? = 6z. 


38. 


39. 


40. 


41. 


42. 
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Determine 


SJS +v») dv, 


where W = f(x, y, z) | yx? +y? < z/2 < 3}. 


Find the volume of the region W that represents the 
intersection of the solid cylinder x? + y? < 1 and the 
solid ellipsoid 2(x* + y?) + z? < 10. 


Find the volume of the solid W that is bounded by 
the paraboloid z = 9 — x? — y?, the xy-plane, and the 
cylinder x? + y? = 4. 


Find 
IT (2+x7+y*)dV, 
WwW 


where W is the region inside the sphere x? + y? + z? = 
25 and above the plane z = 3. 


Find the volume of the intersection of the three solid 
cylinders 


2 


ray <<, 3 


VF au, 


and y? + 2a". 
(Hint: First draw a careful sketch, then note that, by 
symmetry, it suffices to calculate the volume of a por- 
tion of the intersection.) 


5.6 Applications of Integration 


In this section, we explore a variety of settings where double and triple integrals 


arise naturally. 


Sunday 47 


Day JF 
Monday 65 
Tuesday 63 


Wednesday | 52 
Thursday 51 
Friday 45 
Saturday 43 


temperature: 


Average temperature = 


Average Value of a Function 


Suppose temperatures (shown in the adjacent table) are recorded in Oberlin, Ohio, 
during a particular week. From these data, we calculate the average (or mean) 


65 + 63 +52 +51 +45 +43 +47 


zx 52.3°F. 


7 


Of course, this calculation only represents an approximation of the true average 
value, since the temperature will vary during each day. To determine the true 
average temperature, we need to know the temperature as a function of time for 
all instants of time during that one-week period; that is, we consider 


Temperature = T(x), 


x = elapsed time (in days), for 


0<x<7. 


Then a more accurate determination of the average temperature is as an integral: 


7 
Average temperature = + Í T(x)dx. (1) 
0 


Since an integral is nothing more than the limit of a sum, it’s not hard to see that 
the preceding formula is a generalization of the original discrete sum calculation 
to the continuous case. (See Figure 5.107.) 
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Days 
Figure 5.107 A continuous temperature function T(x) 
over the interval [0, 7]. The average temperature for the 
week is + A T(x)dx. 
Note that 


4 
7= / dx = length of time interval. 
0 


Hence, we may rewrite formula (1) as 


fi T(x) dx . 


Average temperature = 7 
h dx 


This observation leads us to make the following definitions concerning average 
values of functions. 


DEFINITION 6.1 (a) Let f:[a,b]— R be an integrable function of one 
variable. The average (mean) value of f on [a, b] is 
io f J? TO fÉ f@œ)dx 
[f= z f sod 2S = 
= dhe iP dx length of interval [a, b] 


(b) Let f: D C R? > R be an integrable function of two variables. The 
average value of f on D is 


_ JIo faA _ Sfo fda 
"e [pA amd D 


(c) Let f: W C R? = R be an integrable function of three variables. The 
average value of f on W is 


_SSIwfdV _ SSSw fay 


= Gay ~ volume of W` 


Lf] 


[avg 


EXAMPLE 1 Suppose that the “temperature function” for Oberlin during a 
week in April is 
T(x) = H x7 — 10776 4 112755 _ 2393 4 4 66821 y3 _ 4578142 4 12581 y 4 65, 


5040 180% T T80% 72 720 360 F 39 * 7 


x+2y=2 


(2.0) > 


Figure 5.108 The triangular 
metal plate of Example 2. 
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where 0) < x < 7. Then the mean temperature for that week would be 


J 
Li 113 7 _ 10756 1127,5 _ 2393,4 
[T avg = tf, ( sean 180% T Tso ~ 72 * 


| 66821,3 _ 45781 „2 4 12581, 
r “799 * 360 ¥ T 310 + 65) dx 
_ 1( 113 %8 _ 107 %7} 112746 _ 2393,5 
= 7 (3020 1260% T T080 360 
| 66821,4 _ 45781,3 4 12581,2 7 
F 3880 1080 ~ T “420 + 65x) lo 
_ 888709 ~, Š 
= 88870 ~ 51,43°F, ‘ 


EXAMPLE 2 Suppose that the thickness of the triangular metal plate, shown 
in Figure 5.108, varies as f(x, y) = xy + 1, where (x, y) are the coordinates of 
a point in the plate. The average thickness of the plate is, therefore, 


RR = oxy + 1) dx dy 


Average thickness = 
fags 2-2y d dy 


Note that 
2—2y 
/ i dx dy = area of triangular plate = 5(2 -l=1 
0 
from elementary geometry. Hence, the average thickness is 


9—Jy 1 
hh @y+Ddxdy _ 12 x=2—2y 
Lat f (ayta) dy 


1 
= (E2 - 2yyy + (2 — 2y)) dy 


EXAMPLE 3 (See also Example 6 of $5.4.) Suppose the temperature inside the 
capsule bounded by the paraboloids z = 9 — x? — y? and z = 3x7 + 3y? — 16 
varies from point to point as 


T(x, y, z) = zx? + y’). 


We calculate the mean temperature of the capsule. 
From Definition 6.1, 


SIJ T av 
Wf av 


The particular iterated integrals we can use for the computation are then 
5/2 a/25/4—x? x? 9—x?—y? 
/ — J z(x? + y*)dzdy dx 
—5/2 J—4/25/4—x? J3x24+3y2—16 


5/2 a/25/4—x? x2 9—x?—y? 
EIE eon 
—5/2 /25/4—x2 x? J 3x?+3y?—16 


[T avg = 


[T avg = 
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Unfortunately, the calculations involved in evaluating these integrals are rather 
tedious. 

On the other hand, since the capsule has an axis of rotational symmetry, 
cylindrical coordinates can be used to simplify the computations. Note that the 
boundary paraboloids have cylindrical equations ofz = 9 — r? and z = 3r? — 16 
and that the shadow of the capsule in the z = 0 plane can be described in polar 
coordinates as 


{(7,0)|0<r<3,0<0 <27}. 
(See Figures 5.109 and 5.110.) 


{(r,0) |0<r<3,0<@<2z} 


y 
y 
x 
x 
z= 3x? + 3y?-16 
or 

z=3r?-16 
Figure 5.109 The capsule of Figure 5.110 The shadow of 
Example 3. the capsule in Figure 5.109 


in the z = 0 plane. 


In addition, the temperature function may be described in cylindrical coordi- 
nates as 


TŒ, y, z) =z? +y?) = zr’. 
Hence, we may calculate 


2m 5/2 p9—r? 2 
tos Jsdager order d0 
[T lave = 2 ! : 


Qn 5/2 p9—r2 
Io A 32-167 dzdr d0 


For the denominator integral, 


2m p5/2 p9—r? 2m p5/2 
/ i J rdedrao = Í | r (9 — r°) — Br? — 16)) dr dO 
0 0 3r2-16 0 0 
2n p5/2 
= Í (25r — 4r°) dr d0 
0 0 


27 25 5/2 
= / (Fr = r) dé 
0 2 0 
20 
625 625 625 625 
=f — — — do =ar a r, 
0 8 16 16 8 


My mı 


2 ^ * 


0 


Figure 5.111 This seesaw 
balances if mıxı + m2x2 = 0. 
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This result agrees with the volume calculation in Example 6 of §5.4, as it should. 
For the numerator integral, we compute 


z=9—r? 


2m p5/2 p9—r? 2n p5/2 z2 
/ | | zr? dzdr d0 = / | 9) dr dé 
o Jo J3r2—-16 o Jo 2 z=3r2—16 


2m 5/2 73 
= | | (9 — r?°? — Gr’ — 16)’) dr d0 
o Jo 2 


2m p5/2 13 
= | | (—8r4 + 78r? — 175) dr dO 
o Jo 2 


1 f27 [52 
=; J (—8r’ + 78r° — 175r°) dr dO 
0 0 
5/2 
do 


1 a 175 
= =f ( r8 + 13r° r) 
2 Jo 4 0 


1 (27 15625 15625 
= d0 = T: 
0 


2 256 256 
Thus, 
— 15625x /256 25 
6250/8 32 


[T ]avg = 


Center of Mass: The Discrete Case 


Consider a uniform seesaw with two masses mı and m2 placed on either end. If 
we introduce a coordinate system so that the fulcrum of the seesaw is placed at 
the origin, then the situation looks something like that shown in Figure 5.111. 
Note that x2 < 0 < xı. The seesaw balances if 


mıxı + mx = Q. 


In this case, the center of mass (or “balance point”) of the system is at the origin. 

But now suppose mıxı + m2x2 Æ 0. Then where is the balance point? Let us 
denote the coordinate of the balance point by x. Before we find it, we’ll introduce 
a little terminology. The product m;x; (in this case, for i = 1,2) of mass and 
position is called the moment of the ith body with respect to the origin of the 
coordinate system. The sum mıxı + m2x2 is called the total moment with respect 
to the origin. To find the center of mass, we use the following physical principle, 
which tells us that a system of several point masses is physically equivalent (in 
terms of moments) to a system with a single point mass. 


Guiding physical principle. The center of mass is the point such that, if all 
the mass of the system were concentrated there, the total moment of the new 
system would be the same as that for the original system. 


Putting this principle into practice in our situation, we see that total mass M 
of our system is mı + m2. If x is the center of mass, then the guiding principle 
tells that 


Mx = mx; + mx2. 
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m,m M3 ay m, 1m, — Thatis, the total moment of the new (concentrated) system is the same as the total 
e. è> è =) ee | ius 
XiX X3 0 Xni Xn moment of the original system. Hence, 
mıxı + mx 
Figure 5.112 A system of n e 
masses distributed on a line. mı T m2 


If we have a system of n masses distributed along a (coordinatized) line, then 
the same reasoning may be applied. (See Figure 5.112.) We have 


y 
i ” total moment mıxı + Mx +*+ Mr Xn oy 
n le = — 
Yxn Yn) (x1, y1) total mass mı +m +---+m 
xX 
M 
(X2, Yo) ; F 
Now we move to two and three dimensions. Suppose, first, that we have n 
Ms saad particles (or bodies) arranged in the plane as in Figure 5.113. Then there are two 
pya moments to consider: 
n 
Figure 5.113 A system ofn Total moment with respect to the y-axis = 5 Mixi, 
masses in R°. i=l 


and 


n 
Total moment with respect to the x-axis = X Miyi. 
i=l 
(Admittedly, this terminology may seem confusing at first. The idea is that the 
moment measures how the system balances with respect to the coordinate axes. 
It is the x-coordinate—not the y-coordinate—that measures position relative to 
the y-axis. Similarly, the y-coordinate measures position relative to the x-axis.) 
The guiding principle tells us that the center of mass is the point (x, y) such that, 
if all the mass of the system were concentrated there, then the new system would 
have the same total moments as the original system. That is, if M = }` m;, then 


n 
Mx = ) MiXi 
i=l 


(i.e., the moment with respect to the y-axis of the new system equals the moment 
with respect to the y-axis of the original system) and 


My = X miyi. 
i=] 


Thus, we have shown the following: 


Discrete center of mass in R?. Given a system of n point masses mı, 
m2, .. ., Mn at positions 

(x1, yı), (x2, Wahooor (Con Yn) in R’, 
the coordinates (x, y) of the center of mass are 

n n 
Aa Ba) MiX a m 
z= dist Game and j= dist ii (3) 
Dai l Paaa Mi 


e 
mM; 


e 
(Xi, Vis Zi) 


x 


Figure 5.114 A discrete system 
of masses in R°. 
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For particles arranged in three dimensions, little more is needed than adding 
an additional coordinate. (See Figure 5.114.) 


Discrete center of mass in R*. Given a system of n point masses mı, 
m2, ..., Mn at positions 


(Ca. yi, Z (x2, y2, fo otes (Gas Yn; Zn) in R$, 
the coordinates (x, y, Z) of the center of mass are given by 
n n n 
= doin MiXi - doin Mii 3 is 
<= eS See, eee (4) 
D Mi iat Mi Vie mi 


The numerators of the fractions in (4) are the moments with respect to the 
coordinate planes. Thus, for example, the sum Zi mixi is the total moment 
with respect to the yz-plane. 

By definition, moments of physical systems are additive. That is, the total 
moment of a system is the sum of the moments of its constituent pieces. However, 
it is by no means the case that a coordinate of the center of mass of a system is 
the sum of the coordinates of the centers of mass of its pieces. This additivity 
property makes the study of moments important in its own right. 


Center of Mass: The Continuous Case 


Now, we turn our attention to physical systems where mass is distributed in 
a continuous fashion throughout the system rather than at only finitely many 
isolated points. 

To begin with the one-dimensional case, suppose we have a straight wire 
placed on a coordinate axis between points x =a and x =b as shown in 
Figure 5.115. Moreover, suppose that the mass of this wire is distributed according 
to some continuous density function 5(x). We seek the coordinate x that represents 
the center of mass, or “balance point,” of the wire. 


=] E 
A=Xy Xp X e i 


x 
| 

x 

= 

= 
i~ 
=> 


Figure 5.115 A “coordinatized” wire. 
The mass of the segment between x;_; and 
x; is approximately 5(x;)Ax;. 


Imagine breaking the wire into n small pieces. Since the density is continuous, 
it will be nearly constant on each small piece. Thus, fori = 1, ...,, the mass m; 
of each piece is approximately ê(x¥)Ax;, where Ax; = x; — x;_, is the length of 
each segment of wire, and x¥ is any number in the subinterval [x;—1, x;]. Hence, 
the total mass is 


M= > a x Sse) Ani, 
i=l 


i=l 
and the total moment with respect to the origin is approximately 


n 


* x 
XP ô(xž)Axi. 
j—] approx. approx. 


position mass 
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Figure 5.116 A lamina 
depicted as a region D in 
the xy-plane with density 
function 6. 


Of course, these results can be used to provide an approximation of the coordi- 
nate x of the center of mass. For an exact result, however, we let all the pieces 
of wire become “infinitesimally small”; that is, we take limits of the foregoing 
approximating sums as all the Ax;’s tend to zero. Such limits give us integrals, 
and we may reasonably define our terms as follows: 


Continuous center of mass in R. For a wire located along the x-axis 
between x = a and x = b with continuous density per unit length 5(x): 


b 
Total mass = f d(x) dx. 


a 


b 
Total moment = i x 5(x) dx. (5) 


total moment _ ie xd(x)dx 


total mass I? 3(@)dx 


Center of mass x = 


Compare the formulas in (3) with those in (5). Instead of a sum of masses 
and a sum of products of mass and position, we have an integral of “infinitesimal 
mass” (the 6(x) dx term) and an integral of infinitesimal mass times position. 


EXAMPLE 4 Suppose that a wire is located between x = —1 and x = 1 along 
a coordinate line and has density 6(x) = x? + 1. Using the formulas in (5), we 
compute that the center of mass has coordinate 


x= = = 


fiixG2+ dx (ixt+ie) 0 
Ca e i} 


This makes sense, since this wire has a symmetric density pattern with respect to 
the origin (i.e., d(x) = ê(—x)). + 


The analogous situation in two dimensions is that of a lamina or flat plate of 
finite extent and continuously varying density 5(x, y). (See Figure 5.116.) Using 
reasoning similar to that used to obtain the formulas in (5), we make the following 
definition for the coordinates (x, y) of the center of mass of the lamina: 


Continuous center of mass in R?. Fora lamina represented by the region 
D in the xy-plane with continuous density per unit area d(x, y): 


total moment with respect to y-axis ffpx d(x, y)dA_ 
total mass E MCa y)dA ° 


(6) 
total moment with respect to x-axis _ Sfp y (x, y)dA 
total mass ff 8, yA © 


y= 
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Roughly, the term 5(x, y)dA represents the mass of an “infinitesimal two-dimen- 
sional” piece of the lamina and the various double integrals the limiting sums of 
such masses or their corresponding moments. 


EXAMPLE 5 We wish to find the center of mass of a lamina represented by the 
region D in R* whose boundary consists of portions of the parabola y = x? and 
the line y = 9 and whose density varies as 6(x, y) = x? + y. (See Figure 5.117.) 

First, note that this lamina is symmetric with respect to the y-axis and that, in 
P addition, the density function has a similar symmetry because ô(x, y) = 6(—x, y). 
We may conclude from these two observations that the center of mass must occur 
along the y-axis (i.e., that x = 0). Using the formulas in (6) and noting that the 
lamina is represented by an elementary region of type 1, 


y 
(-3,9) (3,9) 


3 9 2 
Figure 5.117 The region i= Tp ¥ sŒ, y)dA _ Jos f Wx" + y)dy dx 


D representing the lamina Sfo ôx, y)dA f, Pa + y)dy dx l 
of Example 5. B 


For the denominator integral, we compute 


3 9 3 1 y= 
| | (x7 + y)dydx = J (x?) + 3”) 
—3 Jx? -3 2 y 


For the numerator, 


3 p9 I eee ae 3\ p 
J | yo? + y)dydx = f aa 
3x2 3 \ 2 37|; 


2 3 11664 
= 2 H 243x a x! = as 
2 42 s T 
Hence, 
11664/7 45 
y= paca = — 7 6.43. 
1296/5 7 


This answer is quite plausible, since the density of the lamina increases with y, 
and so we should expect the center of mass to be closer to y = 9 than to y = 0. 
+ 
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x 


Figure 5.118 The tetrahedron of 
Example 6. 


We may modify the two-dimensional formulas to produce three-dimensional 


ones. 


Continuous center of mass in R’. Given a solid W whose density per 
unit volume varies continuously as ô(x, y, z), we compute the coordinates 
(x, y, Z) of the center of mass of W using the following quotients of triple 
integrals: 


total moment with respect to yz-plane ff fy x 5(x, y, 2) dV 


x 


total mass © Pea aay 
_ _ total moment with respect to xz-plane _ fffy y d(x, y,z)dV_ (7) 
cas total mass isa aay 
_ total moment with respect toxy-plane fffyzê@, y, z)dV 
= = 


total mass Tit say za 


In (7) we may think of the term 5(x, y, z)dV as representing the mass of an 
“infinitesimal three-dimensional” piece of W. Then the triple integrals are the 
limiting sums of masses or moments of such pieces. 


EXAMPLE 6 Consider the solid tetrahedron W with vertices at (0, 0, 0), 
(3, 0, 0), (0, 3, 0), and (0, 0, 3). Suppose the mass density at the point (x, y, z) 
inside the tetrahedron is ô(x, y, z) = x + y +z + 1. We calculate the resulting 
center of mass. (See Figure 5.118.) 

First, note that the position of the tetrahedron in space and the density function 
are both such that the roles of x, y, and z may be interchanged freely. Hence, the 
coordinates (x, y, Z) of the center of mass must satisfy x = y = Z. Therefore, we 
may reduce the number of calculations required. 

The tetrahedron is a type 4 elementary region in space. Thus, we may calculate 
the total mass M of W, using the following iterated integral: 


3 3-x 3-—x-y 
u= | | / (x+y+z+ 1l)dzdydx 
o Jo 0 


z=3-—x-y 


-f [7 (w+y+02+5) 7 


3 3=x 
z| 1 (5 X 5x? y—xy ły?) dy dx 
o Jo 
3 


dydx 


= | -r-H r dx 


3 
_ 27 15 ype x2 — 117 
=i (3 gi tate )dx=y. 


The total moment with respect to the x y-plane is given by 


3 3-x 3-—x-y 
| / | zx + y+z+ l)dzdydx 
o Jo 0 


x 


Figure 5.119 The cone of 
Example 7. 
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z=3-—x-y 


$55 2 p 
-| i (w+y+n5+5) dy dx 
0 v0 2 3 z=0 
3 P= 157 45 1 
, 12 
=] | (=n tF 
o Jo 


+ xy ix y4 sy | xy | iy?) dy dx 


Hence, 


51 
- = = _ 40 

=5=z7= ÍL = hate, 

=== m 65 $ 


Ifan object is uniform, in the sense that it has constant density, then one uses 
the term centroid to refer to the center of mass of that object. Suppose the object 
is a solid region W in R°. Then, if the density 6 is a constant k, the equations for 
the coordinates (x, y, Z) may be deduced from those in (7). For the x-coordinate, 
we have 


SÍS xy, dV fffykxdV 
Sff Œy DdV  ff{fykav 


SSfyx dV 1 Tf] dV 
= = X è 
Sffy 4V volume of W W 
Similarly, 


1 1 
y= — dV d z= —__ dV. 
Á volume of W If oo volume of W [ff 


In particular, the constant density 5 plays no role in the calculation of the 
centroid, only the geometry of W. (Note: Completely analogous statements can 
be made in the case of centroids of laminas in R?.) 


X= 


EXAMPLE 7 We compute the centroid of a cone of radius a and height h. (See 
Figure 5.119.) 

By symmetry, x = y = 0. Moreover, we know that the volume of the cone is 
(1/3)a7h. Thus, the z-coordinate of the centroid is 


3 
z= dV. 
= fy? 


This triple integral is most readily evaluated by using cylindrical coordinates. 
(See Example 15 of §5.5.) The lateral surface of the cone is given by z = hy, so 
we calculate 


3 27 a h 3 rah? 3 
Z = —— dzdr dð = = —h 
z wan | I fie = (xn) ( 4 4 


after a straightforward evaluation. Hence, the centroid of the cone is located at 
(0, 0, åh). + 
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Figure 5.120 The box of 
Example 8. 


Moments of Inertia 


Let W be a rigid solid body in space. As we have seen, the moment integral 
with respect to the xy-plane is Myy = ff fy 2 5(x, y, z)dV—that is, the inte- 
gral of the product of the position relative to a reference plane (in this case the 
xy-plane) and the density of the solid. This integral can be considered to mea- 
sure the ease with which W can be displaced perpendicularly from the reference 
plane. 

Now, consider spinning W about a fixed axis (which may or may not pass 
through W). The moment of inertia / (or second moment—the moment integral 
mentioned in the preceding paragraph is sometimes called the first moment) is 
a measure of the ease with which W can be made to spin about the given axis. 
Specifically, / is the integral of the product of the density at a point in W and the 
square of the distance from that point to a fixed axis; that is, 


I = [|| Ps¢y. av, (8) 


where d is the distance from (x, y, z) € W to the specified axis. 
When the axes of rotation are the coordinate axes in R?, we have 


I, = moment of inertia about the x-axis = Tf O rA mes 
w 

I, = moment of inertia about the y-axis = o CERES aV. 
w 


I, = moment of inertia about the z-axis = i Gy Gaya. 
w 


EXAMPLE 8 Let W be a solid box of uniform density ê and dimensions a, b, 

and c. If W is situated symmetrically with respect to the coordinate axes as shown 

in Figure 5.120, we compute the moments of inertia with respect to these axes. 
Note, first, that W may be described as 


w=kh )| © = ail b 2? co, E 
Ta ne Ge ry, ee gg | 


Hence, the moment of inertia about the x-axis is 


c/2 b/2 a/2 c/2 b/2 
t= J O? + 2°) ô dx dy dz = O? + 27) ady dz 
—c/2 J —b/2 J —a/2 —c/2 J—b/2 
y=b/2 


c/2 y? ef2 b3 
= 6a | ($+) dz = ŝa Í (+02) dz 
—c/2 3 y=—b/2 —c/2 12 


be be dabc 
=ô | = b? | 2 x 
i i” 72 ) vee 


Top: z=4 


x 


Figure 5.121 The solid W of 
Example 9. 


Cone: z =2 Vx? +y? 


y 
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By permuting the roles of x, y, and z (and the corresponding constants a, b, and 
c), we see that 


dab 
(a@ +) and L= a +b’). 


dabc 
lL = —— 
: 12 


Therefore, if a > b > c (as in Figure 5.120), it follows that I. < I, < I,. This 
result may be confirmed by the observation that rotations about the axis parallel 
to the longest side of the box are easiest to effect in that the same torque applied 
about each axis will cause the most rapid rotation to occur about the axis through 
the longest dimension. A related fact is regularly exploited by figure skaters who 
pull their arms in close to their bodies, thereby reducing their moments of inertia 
and speeding up their spins. + 


EXAMPLE 9 Let W be the solid bounded by the cone z = 2,/x? + y? and the 
plane z = 4 shown in Figure 5.121. Assume that the density of material inside W 
varies as ô(x, y, z) = 5 — z. Let us calculate the moment of inertia 7, about the 
z-axis. 

Given the geometry of the situation, it is easiest to work in cylindrical coor- 
dinates, in which case the cone is given by the cylindrical equation z = 2r. Thus, 
we have 


In p2 pa 
=| {| +D y.2aV = | 1 i r°(5 — z)rdzdr dé 
w o Jo Jz 


27 2 27 2 
=f [e EAEra h f 02r- 10 2)arao 
0 0 ai 0 0 


27 16 327 


2r 
_ 455 4 1,6) = 19 _ a a 
=f (3r Ol ae ed )| -» 40 Í 3 dé 5 j 


Recall that the center of mass of a solid object of total mass M is the point 
such that if all the mass M were concentrated there, the (first) moment would 
remain the same. An analogous idea may be defined in the context of moments of 
inertia. The radius of gyration of a solid with respect to an axis is the distance 
r from that axis that we should locate a point of mass M so that it has the same 
moment of inertia 7 (with respect to the axis) as the original solid does. More 
concisely, the radius of gyration r is defined by the equation 


EXAMPLE 10 We determine the radius of gyration with respect to the z-axis 
of the cone described in Example 9. Hence, we compute 
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From Example 9, I, = 3277/3. We determine the total mass M of the cone as 


follows: 


2n 2 4 On 2 
M -| / (5 = zyr dzdr do =f / (12 — 10r + 2r7)r dr d0 
0 0 2r 0 0 


=f” ((°- 


Thus, 


5.6 Exercises 


. The local grocery store receives a shipment of 75 cases 


of cat food every month. The inventory of cat food (i.e., 
the number of cases of cat food on hand as a function 
of days) is given by Z(x) = 75 cos(x x/15) + 80. 

(a) What is the average daily inventory over a month? 


(b) If the cost of storing a case is 2 cents per day, 
determine the average daily holding cost over the 
month. 


. Find the average value of f(x, y) = sin? x cos? y over 


R = [0, 27] x [0, 47]. 


. Find the average value of f(x, y) = e?*+» over the tri- 


angular region whose vertices are (0, 0), (1,0), and 
(0, 1). 


. Find the average value of g(x, y, z) = e* over the unit 


ball given by 
B = {(x, y, z) | x? +y +z” < 1}. 


. Suppose that the temperature at a point in the cube 


W = [-1, 1] x [-1, 1] x [-1, 1] 
varies in proportion to the square of the point’s distance 
from the origin. 
(a) What is the average temperature of the cube? 


(b) Describe the set of points in the cube where the 
temperature is equal to the average temperature. 


. Let D be the region between the square with ver- 


tices (1, 1), (—1, 1), (—1, —1), (1, —1) and the unit 
disk centered at the origin. Find the average value of 
f(x, y) =x? + y* on D. 


. Let W be the region in R? between the cube with 


vertices (1, 1, 1), (—1, 1, 1), (—1, —1, 1), (1, —1, 1), 
(1; 1-0); L L1); (-1,-1,-D, d.—-1, -1) 
and the unit ball centered at the origin. Find the av- 
erage value of f(x, y, z) = x? +y +z on W. 


10,3. 1,452 2 
d +AT = f 3 3 


327/3 
= =l. + 
T 327/3 


8. 


10. 


11. 


12. 


13. 


14. 


15. 


27 16 


32 
ar F EEN 


Suppose that you commute every day to work by sub- 
way. You walk to the same subway station, which is 
served by two subway lines, both stopping near where 
you work. During rush hour, each subway line sends 
trains to arrive at the stop every 6 minutes, but the dis- 
patchers begin the schedules at random times. What is 
the average time you expect to wait for a subway train? 
(Hint: Model the waiting time for the two subway lines 
by using a point (x, y) in the square [0, 6] x [0, 6].) 


. Repeat Exercise 8 in the case that the subway stop is 


serviced by three subway lines (each with trains arriv- 
ing every 6 minutes), rather than two. 


Find the center of mass of the region bounded by the 
parabola y = 8 — 2x” and the x-axis 

(a) if the density 5 is constant; 

(b) if the density 6 = 3y. 


Find the centroid of a semicircular plate. (Hint: Judi- 
cious use of a suitable coordinate system might help.) 


Find the center of mass of a plate that is shaped like the 
region between y = x? and y = 2x, where the density 
varies as 1 +x + y. 


Find the center of mass of a lamina shaped like the 
region 


(œ, y) l0 <y < Vx, 0 <x <9}, 


where the density varies as xy. 


Find the centroid of the region bounded by the cardioid 
given in polar coordinates by the equation r = 
1 — sin 0. (Hint: Think carefully.) 


Find the centroid of the lamina described in polar 
coordinates as 


{(r,0)|0<r <4cos0, 0 <60 < 7/3}. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


Find the center of mass of the lamina described in polar 
coordinates as 


{(r,0)|O0<r<3,0<0 < 7/4}, 


where the density of the lamina varies as ô(r, 0) = 
4-r. 


Find the center of mass of the region inside the car- 
dioid given in polar coordinates as r = 1 + cos 0, and 
whose density varies as ô(r, 0) = r. 


Find the centroid of the tetrahedron whose vertices are 
at (0, 0, 0), (1, 0, 0), (0, 2, 0), and (0, 0, 3). 


A solid is bounded below by z = 3y?, above by the 

plane z = 3, and on the ends by the planes x = —1 

and x = 2. 

(a) Find the centroid of this solid. 

(b) Now assume that the density of the solid is given 
by ô =z+x?. Find the center of mass of the 
solid. 


Determine the centroid of the region bounded above 
by the sphere x? + y? +z? = 18 and below by the 
paraboloid 3z = x? + y?. 


Find the centroid of the solid, capsule-shaped region 
bounded by the paraboloids z = 3x? + 3y? — 16 and 
z=9- x — y. 

Find the centroid of the “ice cream cone” shown in 
Figure 5.122. 


Sphere: x? + y?+z7=25 


Cone: z =2 Vx? + y? 
y 


x 


Figure 5.122 The ice cream cone solid 
of Exercise 22. 


Find the centroid of the solid shaped as one-eighth of 
a solid ball of radius a. (Hint: Model the solid as the 
first octant portion of a ball of radius a with center at 
the origin.) 


Find the center of mass of a solid cylindrical peg of 
radius a and height h whose mass density at a point 
in the peg varies as the square of the distance of that 
point from the top of the cylinder. 


25. 


26. 


27. 


28. 
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(a) Find the moment of inertia about the coordinate 
axes of a solid, homogeneous tetrahedron whose 
vertices are located at (0,0,0), (1, 0, 0), (0, 1, 0), 
and (0, 0, 1). 

(b) What are the radii of gyration about the coordinate 
axes? 


Consider the solid cube W = [0, 2] x [0, 2] x [0, 2]. 
Find the moments of inertia and the radii of gyration 
about the coordinate axes if the density of the cube is 
6(x,y,z)=xtytz+1. 


A solid is bounded by the paraboloid z = x? + y? and 
the plane z = 9. Find the moment of inertia and radius 
of gyration about the z-axis if 


(a) the density is d(x, y, z) = 2z; 

(b) the density is 5(x, y, z) = y x? + y?. 

Find the moment of inertia and radius of gyration about 
the z-axis of a solid ball of radius a, centered at the ori- 
gin, if 

(a) the density ô is constant; 

(b) 6@, y, z) =x? +y? +z’; 

(©) 6,9, = x° +y’. 


We can find the moment of inertia of a lamina in the plane 
with density 5(x, y) by considering the lamina to be a flat plate 
sitting in the xy-plane in R°. Then, for example, the distance 
of a point (x, y) in the lamina to the x-axis is given by |y], the 
distance to the y-axis is given by |x|, and the distance to the 
z-axis (or the origin) is given by \/x* + y?. (See Figure 5.123.) 
Using these ideas, find the specified moments of inertia of the 
laminas given in Exercises 29-31. 


29. 


30. 


\ 
a 
i j 


Figure 5.123 A lamina situated in 
the xy-plane in R°. 


The moment of inertia /, about the x-axis of the 
lamina that has the shape bounded by the graph of 
y =x? +2 and the line y = 3, and whose density 
varies as d(x, y) = x? + 1. 


The moment of inertia 7, about the z-axis of the lam- 
ina shaped as the rectangle [0, 2] x [0, 1], and whose 
density varies as d(x, y) = 1 + y. 
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31. The moment of inertia about the line y = 3 of the lam- to (0, 0,7). So, in particular, r is the distance from the point 
ina shaped as the disk mass m to the center of W. 


{(@, y) [9° +9" 54}, 
and whose density varies as 6(x, y) = x”. 
The gravitational field between a mass M concentrated at 
the point (x, y, z) and a mass m concentrated at the point 
(xo, Yo, Zo) is 
GMm[(x — xo)i + Y — yo)j + & — zo)k] 
[(x — x0)? + O — yo)? + E- zo Pe 
The gravitational potential V of F is 


GMm 
V(x = x0? + = yo + & = 20 
Figure 5.124 The spherical 
(We have seen in §3.3 that F = —VV.) Now suppose that, in- shell of Exercises 32-34. 
stead of a point mass M, we have a solid region W of density 
5(x, y, z) and total mass M. Then the gravitational potential of 32. Show that ifr > b, then V(0, 0, r) = —GMm/r. This 
W acting a the point mass m may be found by looking at “in- is exactly the same gravitational potential as if all the 
finitesimal” point masses dm = 8(x, y, z)d V and adding (via mass M of W were concentrated at the origin. This is a 
integration) their individual potentials. That is, the potential key result of Newtonian mechanics. (Hint: Use spher- 
of W is ical coordinates and integrate with respect to g before 
V(X0, Yo, Zo) integrating with respect to p.) 
33. Show that ifr < a, then there is no gravitational force. 
Gmô(x, y,z)dV ; reas : ; 
= ITH = Gy a =" (Hint: Show that V (0, 0, 7) is actually independent ofr. 
w J (x — x0? + O — yo + E — zo) Then relate the gravitational potential to gravitational 
In Exercises 32-34, Let W be the region between two con- force. As in Exercise 32, use spherical coordinates and 
centric spheres of radii a < b, centered at the origin. (See integrate with respect to ọ before integrating with re- 
Figure 5.124.) Assume that W has total mass M and constant spect to p.) 
density 5. The object of the following exercises is to compute 34. (a) Find V(0,0,r)ifa <r <b 


the gravitational potential V (xo, yo, zo) of W on a mass m 
concentrated at (xo, Yo, zo). Note that, by the spherical sym- 
metry, there is no loss of generality in taking (xo, Yo, Zo) equal 


(b) Relate your answer in part (a) to the results of Ex- 
ercises 32 and 33. 


57 Numerical Approximations of Multiple 
Integrals (optional) 


Suppose that f is a continuous function defined on some bounded region D in the 
plane. If D is arectangle or an elementary region, then Theorem 2.10 enables us to 
calculate ff f dA as an iterated integral. However, the “partial antiderivatives” 
in the iterated integral may turn out to be impossible to determine in practice. 
Nonetheless, an approximate value of ff p f dA may be entirely acceptable for 
some purposes. In this section, we discuss how we can adapt numerical meth- 
ods for approximating definite integrals of functions of a single variable to give 
techniques for approximating double integrals. 


Numerical Methods for Definite Integrals 
Let us review two familiar techniques for approximating the value of Í f(x) dx. 
The first of these is the trapezoidal rule. We begin by partitioning the interval 
[a, b] into n equal subintervals. Thus, we set 

b—a 


Ax = , a&a=X0<X1 <... < X =b, where x; = a +i ^x. 
n 


a=x0 xy Xii Xi Xp Xn =H 


Figure 5.125 When f is 
nonnegative on [a, b], the area 
under the graph of f is 
approximated by the sum of 
areas of the trapezoids shown. 
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The trapezoidal rule approximation T, is 


II 


T = S* OSH H 2S) + FOI 


n—1 
-2 [9 +2 rosso], (1) 
i=l 


It is obtained by approximating the function f with n linear functions that pass 
through the pairs of points (x;—1, f(%;-1)) and (x;, f(x;)) fori = 1, ..., n. Then 
the (net) area under the curve y = f(x) between x = a and x = b is approximated 
by the sum of the (net) areas under the graphs of the linear functions. When f 
is nonnegative, the approximating areas are those of trapezoids—thus, the name 
for the method. (See Figure 5.125.) 

The key theoretical result concerning the trapezoidal rule is given by the 
following. 


THEOREM 7.1 Given a function f that is integrable on [a, b], we have 


b 
i f(x)dx = Ta + En, 


where 
Ax n—1 
T, = as os fai) + rol : 
i=l 


and E,, denotes the error involved in using T, to approximate the value of the 
definite integral. In addition, if f is of class C? on [a, b], then there is some 
number ¢ in (a, b) such that 


(b - ay 


at": 


= b—a 2 pn nes 
En BEETA f ¢)= 


In particular, Theorem 7.1 shows that if f” is bounded, that is, if | f’(x)| < M 
for all x in [a, b], then |E,,| < (b — a) M /(12n?). This inequality is very useful 
in estimating the accuracy of the approximation. 


EXAMPLE 1 We approximate i sin (x?) dx using the trapezoidal rule with 
n = 4. Thus, we have 


1—0 
Ax = 2 = 025, 
*=-7 


so that, using (1), we have 


l 0.25 
| sin (x?) dx © Ty = a [sin 0 + 2 sin (0.25°) + 2 sin (0.5?) + 2 sin (0.75°) 
0 


+ sin 1] 


0.25 
tag [0 + 0.12492 + 0.49481 + 1.06661 + 0.84147] 


0.25 
= —$ [2.52780] = 0.31598. 
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Note that the second derivative of sin (x°) is 2 cos (x?) — 4x? sin (x°) so that, for 
0<x<1, 


|2 cos (x?) — 4x? sin (x7)| < 2| cos (x7)| + 4x7| sin (x| <2+4=6. 
Hence, using Theorem 7.1, 


(1-0/6 1 
——— = — = 0.03125. 

12.42 32 
Thus, the true value of the integral must be between 0.31598 — 0.03125 = 0.2847 
(rounding to four decimal places) and 0.31598 + 0.03125 = 0.3473. Of course, 
a more accurate approximation may be obtained by taking a finer partition (i.e., 
a larger value for n). + 


|E4| < 


Another numerical technique for approximating the value of J. i f(x)dx with 
which you may be familiar is Simpson’s rule. As with the trapezoidal rule, we 
partition the interval [a, b] into equal subintervals, only now we require that the 
number of subintervals be even, which we will write as 2n. Thus, we take 

b—a 
2n 
Then the Simpson’s rule approximation Szn is 


Ax = 


> A=X) < XxX <... < Xn = b, where x; =a+iAx. 


= = OEST EE ES EEE E, 


+4 Ff (X2n-1) + f) 


A n—1 n 
= = [ro +29 f(xi)+4 > fer) + ro ; (2) 
i=l i=l 


EXAMPLE 2 We approximate the value of IA sin (x?) dx of Example 1 using 
Simpson’s rule with n = 2 (i.e., four subintervals). As before, we have 
1—0 
Ax = —— = 0.25, 
4 


so (2) gives 


f 0.25 
l sin (x°) dx © S4 = a [sin 0 + 4 sin (0.25*) + 2 sin (0.5*) + 4 sin (0.757) 
0 


+sin 1] 
0.25 
=> [0 + 0.24984 + 0.49481 + 2.13321 + 0.84147] 
0.25 
aie [3.71933] = 0.30994. 
Note that this value is in the range predicted by the trapezoidal rule. In fact, it is 
a more accurate approximation to the value of the definite integral. + 


Simpson’s rule is obtained by approximating the function f with n quadratic 
functions that pass through triples of points (x2;—2, f (X2i—2)), (X2i-1, f X2i-1)) 
(x2;, f(%2;)) fori = 1, ..., n. As with the trapezoidal rule, we have the following 
summary result. 
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THEOREM 7.2 Given a function f that is integrable on [a, b], we have 


b 
J f(x)dx = Son + Eon, 
where 
Ax n—1 n 
Son = | f@+2) faxa) +4) fer + fO) |, 
i=l i=l 
and E>, denotes the error involved in using S2, to approximate the value of the 


definite integral. In addition, if f is of class C4 on [a, b], then there is some 
number ¢ in (a, b) such that 


_ b-a 4 (4) _ es 
Ean = ~~ (Ax) £90) = - Sa 


JOO. 
EXAMPLE 3 Consider f (x? + 3x?) dx. We compare the trapezoidal rule 


and Simpson’s rule approximations with 4 subintervals. We thus have Ax = 
(3 — 1)/4 = 0.5 and 


0.5 

Ty = > [4 + 2(10.125 + 20 + 34.375) + 54] = 46.75; 
0.5 

Sa = — [4 + 4(10.125) + 2(20) + 4(34.375) + 54] = 46. 


3 
Note that 


3 
[ (432) ax= (fot +2) = (427) — 4) <4, 
1 


so Simpson’s rule agrees with the exact answer. This should not be a surprise, 
since for f(x) = x? + 3x’, we have that f(x) is identically zero, which means 
that the error term E4 for Simpson’s rule must be zero. + 


Approximating Double Integrals over Rectangles 
Now let f be a function of two variables that is continuous on the rectangle 
R = [a, b] x [c, d] in R?. We adapt the previous ideas to provide methods for 
approximating the value of the double integral ffp f dA. 

Because we assume that f is continuous on R, Fubini’s theorem applies to 


give 
I fada = f f so. ydyae. 6) 


Next we partition the x-interval [a, b] into m equal subintervals. Thus, 


b-a 
Ax = >, AG=Xy < XxX, <...< Xm = bd, where x; = a + i Ax. 
m 


Similarly, we partition the y-interval [c, d] into n equal subintervals; hence, 


d—-c 
Ay= , C=y<y<...<Yn =d, where yj = c + jAy. 
n 


In the inner integral f 4 f(x, y)dy ofthe iterated integral in (3), the variable x is 
held constant. Therefore, we may approximate this integral using the trapezoidal 
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rule of Theorem 7.1. We obtain 
n-1 
[ fx, yay x È | 6 +2) fey) Se. o. 


Next, integrate each function of x appearing on the right side, so that 


f [ senavare 
Ay an f(a, nae 2 f fesas f° sadar, 


Now use the trapezoidal rule again on each integral appearing on the right. This 
means that, for j = 0,...,7, 


(4) 


m—1 
[ F(x, ydr w = ja +2) fæ y) + FO, va} (5) 


Putting (4) and (5) together, we obtain 


b d Ay m—l1 
| | senaar~ ZF] fa, )+2¥ fond + SOO 


=] 


n=l A m—1 
+250 = = yj) ta fxi, yj) + FO, 7 
j= = 


i=l 


m—1 
+o [re d)j+29  fnd)+ fO, o|}. 


Therefore, the trapezoidal rule approximation Tn,n to f f n fdAis 


Tan = 


m—l1 
Aray [re )+2¥ flat fOe) 


n—1 n—1m-1 


a DHEDHI fe y) © 


j=l i= 


m—1 
+ f(a,d)+2)° fOi.d)+ fO, D . 
i=l 


The expression appearing in (6) is not too memorable as it stands. However, 
we may interpret it as follows: 


Ax - n m 


dD wif, yj), 


j=0 i= 


Imn = 
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where 


1 if (aj, yj) is one of the four vertices of R; 
wij = 42 if(x;, y;) is a point on an edge of R, but not a vertex; 
4 if (xj, y;) is a point in the interior of R. 


EXAMPLE 4 We approximate the value of f$ fay +3x)dydx with T3 >. 
Thus, the x-interval [3, 6] is partitioned using Ax = (6 — 3)/3 = 1 and the y- 
interval [1, 2] is partitioned using Ay = (2 — 1)/2 = 0.5. See Figure 5.126 for 
the rectangle [3, 6] x [1, 2] with partition points marked. 

Hence, we have 


6 p2 
/ / (xy + 3x) dy dx © 
3 J1 


1(0.5 
Bo = ( dh 24+ 30+4 15 


4 
H 2(16 20497 90-495 4 13.5) + 4(18 + 22.5)] 


II 


1 
5 (486) = 60.75. 


Figure 5.126 The rectangle 

[3, 6] x [1, 2] of Example 4 with 
partition points marked with the 
values of f(x, y) = xy + 3x. 


For comparison, we calculate the iterated integral directly: 


6 p2 6 xy? 2 

/ J Gy +3x)dyax = Í (+) dx 

3 Ji 3 2 Ji 

6 6 

9 9 243 

= l Lxdx = ^x?| = — = 60.75. 
3 2 4 |, 
Note that T3 2 gives the exact answer in this case. + 


In the derivation above of the trapezoidal rule we assumed that the func- 
tion f was continuous. Nonetheless, we may use formula (6) to approximate 
Jfk f dA whenever f is integrable on R. However, in order to make estimates 
of the accuracy of trapezoidal approximations, we must assume more about f, as 
the following result (whose proof involves use of the intermediate value theorem 
and the mean value theorem for integrals) indicates. 
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THEOREM 7.3 (TRAPEZOIDAL RULE FOR DOUBLE INTEGRALS OVER RECTANGLES) 
Given a function f that is integrable on the rectangle R = [a, b] x [c, d], we have 


Ii f(x, y) dA = Tm,n + Emns 
R 


where Tm,n is given by (6) and Em,n denotes the error involved in using Tm,n to 
approximate the value of the double integral. Moreover, if f is of class C? on R, 
then there exist points (¢1, nı) and (¢2, 72) in R such that 


b= ald =o) 


Ena = 
; 12 


(Ax) fex(S1,m) + (Ay) foy (2; 02) - 


EXAMPLE 5 For f(x, y) = xy + 3x, we have that 3? f/dx? and ə? f/dy? are 
both identically zero. Hence, Theorem 7.3 implies that the trapezoidal rule ap- 
proximation is exact, as we already observed in Example 4. + 


In a manner entirely analogous to our derivation of the trapezoidal rule, we 
may also produce a Simpson’s rule approximation to the value of the double 
integral f f r f dA. As in the case of Simpson’s rule for approximating single- 
variable definite integrals, we partition both the x- and y-intervals into even 
numbers of equal subintervals. Thus, we take 


b—a 


2m 


Ax = 


, A=X) < XxX] <... < Xm = b, where x; = a + i Ax, 


Cc 
Ay = a C= yo < yi <... < ym =d, where yj = c + jAy. 
n 


The resulting Simpson’s rule approximation $22, is given by the expression 


Som, 2n = 


m—l1 m 
Ara [re +29 Son O +AY Soan) SOA 


n—l m—1 n—l m 
+25 ra WZ D Gnn ti LT 1+ Y2j) 
j=l i= j=l i= 
n—1 
+2% fob, yz) (7) 
j=l 
n m—l 


Hay sa y2j- D+8D 2 SOx y2- 1) 


j=l i= 
n m 


+16% >> faa 1, Yaj— D+4y fo, y2j-1) 


j=l i=1 


m—1 m 
+ f(a,d)+2 >> fan, d) +4 >> f@zż-1,d)+ fO, J ; 
i=l i=l 
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Just as in the case of the trapezoidal rule for double integrals over rectangles, the 
intermediate value theorem and the mean value theorem for integrals provide the 
following result. 


THEOREM 7.4 (SIMPSON’S RULE FOR DOUBLE INTEGRALS OVER RECTANGLES) 
Let f be a function that is integrable on the rectangle R = [a, b] x [c, d]. Then 


J fx, y) dA = Som,2n + Em,2n, 
R 


where Som, 2n 1S given by (7) and Ezm,2n denotes the error involved in using Som 2n 
to approximate the value of the double integral. Moreover, if f is of class C4 on 
R, then there exist points (¢1, nı) and (¢2, n2) in R such that 


b — a)(d — 
Eom 2n = -0- ono (AF fexxx(ti, m1) + (Ay) fiyyy (fa, m)] : 


EXAMPLE 6 We compare approximations to the value of i fo e dy dx 
using both the trapezoidal rule and Simpson’s rule with four subintervals in 
each of the x- and y-intervals. Thus, the x-interval [0, 0.5] is partitioned using 
Ax = (0.5 — 0)/4 = 0.125 and the y-interval [0, 1] is partitioned using Ay = 
(1 — 0)/4 = 0.25. 

The trapezoidal rule gives 


(0.125)(0.25) 
T44 = i [eo et 05 e!t? l eo dtl 2 (e015 4 et 
L p0t0.75 Jeh SrO L ghee uk g0375+0 i. 012571 Fel 


0.37541 | „0.5+0.25 | ,0.5+0.5 1 ,0.5+0.75 
+ PP Pt! +e + eto 4 e0307) 


4 (e-125+0.25 + e?125+0.5 4. e? 125+0.75 + e?25+0.25 + e??5t0.5 


0.25+0.75 0.375+0.25 0.375+0.5 0.375+0.75 
sia iain aie gto) 


— (14 4) = 1.121944. 
= í 3.608854) = 9 


Note that 3?/3x? (e*t) = 8?/3y? (e*t) = e**”. Theorem 7.3 says that there 
exist points (¢1, nı) and (¢2, 72) in the rectangle [0, 0.5] x [0, 1] such that 


(0.5 — 0)(1 


—0 
E44 = T ) [(0.125} e8 +™ + (0.25 e21]. 


On the rectangle [0, 0.5] x [0, 1], the a possible value of e*t is e°? = 1 
and the largest possible value is e°°+! = e!-5, Hence, we must have 


_ (0.5)(1) (0 DO 
12 


[(0.125) e3 + (0.25¥ e15] < E44 < — [(0.125)° + (0.25) ] 
or 


—0.0145888 < E4,4 < —0.003255. 
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Hence, the true value of the double integral lies between 1.121944 — 0.0145888 = 
1.10735 and 1.121944 — 0.003255 = 1.11869. 
On the other hand, Simpson’s rule gives 


Sigs 01230.23) Gi L e?+05 4 e!+0 4 e05+ 
+2 (e05 4. 02540 4 g025+1 pa) 


+ 4 (2e+?S + e0t0.75 + e? 125+0 Jet + e?25t0.5 


0.375+0 , 0.37541 | „0.5+0.25 , „0.5+0.75 
+e pertti g + e?) 


+8 (eens + g9:25+0.25 4 00:25+0.75 ae gees) 


0.125+0.25 .125+0.75 0.375+0.25 .375+0.75 
+ 16(e + +e? +0.75 1 90.375+ + 0374075) ] 


1 
= — (321.036910) = 1.114711. 
288 ) 


In this case, we note that 0*/ax* (e*t) = a4/dy4 (e*t) = e*t, so, as before, 
the minimum and maximum values of these partial derivatives on [0, 0.5] x [0, 1] 
are, respectively, 1 and e!°. Therefore, Theorem 7.4 implies that 


_ ©. eon) (0.5)) DW [ 


(0.125)*e! (025V ] < Eres — 


(0.125)* + (0.25)*] 
or 
—0.0000516688 < E44 < —0.0000115289. 


Hence, the true value of the double integral lies between 1.114711 — 
0.0000516688 = 1.11466 and 1.114711 — 0.0000115289 = 1.1147. 
For comparison, we may calculate the iterated integral exactly: 


0.5 pl 0.5 i 0.5 
| / et) dydx = | (e) o dx = if (et! — e”) dx 
o Jo 0 i 0 


= (et! — e”) 4 =e!5 — 05 _e +41 1.114686. 


Thus, we see that Simpson’s rule gives a highly accurate approximation with a 
very coarse partition. + 


Approximating Double Integrals over Elementary Regions 


We can modify the methods for approximating double integrals over rectangles 
to approximate double integrals over more general regions. Suppose first that D 
is an elementary region of type 1; that is, 


D = {(x, y) € R? | y(x) < y < 8(x), a < x < b}. 


Figure 5.127 The type 1 
elementary region D with 
partition points marked. 
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Then, if f is continuous on D, Theorem 2.10 tells us that 


b d(x) 
| faa= f f(x, y)dy dx. 
D a 


ya) 


To approximate this iterated integral with a version of, say, the trapezoidal rule, 
we need to partition the region D in a reasonable way. We do so as follows. First, 
we partition the x-interval [a, b] in the usual way: 


Ax = , A=X) < X] <... < Xm =b, where x; = a + i Ax. 


Now, for a fixed x in [a, b], we partition the corresponding y-interval [y (x), 5(x)] 
into n equal subintervals: 


d(x) — y@) 
7-3 4 


Ay(x) — 


v(x) = Yo < yı <... < Yn = (x), where yj(x) = y(x) + jAy(). 


Note that now Ay and the partition numbers yo, ..., y, are all functions of x. (See 
Figure 5.127.) Then, by applying the trapezoidal rule first to the inner integral 
and then the outer one, we obtain 


b pd(x) 
/ Fx, y)dy dx 
a dy 


(x) 


A n—1 
=f y(x SO fj, v(x) +2 5° fa, yj(x)) + fx, se) | 


j=l 


Fi, Yi) 


m—l1 
= S| SO fe, ya) 429) S n 


+O Fo, 7) 


j=l m—1 
nE | Ope, yl) + 2D Ara SAET 
j=l = 
b 
P Are) f(b, y; | 


f(a, (a va 


m—l1 
fae Ay(a) ave AY re. s(x) 


+ SO Fe, so]. 
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Figure 5.128 The partitioned 
region D of Example 7. 


. x 
2 2.1 2.2 


The trapezoidal rule approximation is, thus, 


n—1 
Ting = ZAO [re ya) +25 fla, ya) + fa, sw) 


j=! 


m-l A A i nol 
DD -a 2 (vO reaa O 


j=l 


+ 2f(%i, sa) 


4 


Aghati n—1 
i) ve. y(b)) +2 >> f(b, yj) + f, so) 


j=l 


EXAMPLE 7 We approximate J fie (x? + y’) dy dx by To 4. We have 


2.2—2 
= 5 =0.1; so 2 =2, 4% = 2.1, x= 2.2 


Ax 


and, therefore, that 


49 
so yo(xXo) = 2, yi(xo) = 2.5, y2(xo) = 3, y3(xo) = 3.5, y4(x0) = 4, 
4.2 —2.1 
Ay(2.1) = A = 0.525, 
so yo(x1) = 2.1, yix) = 2.625, yo(x1) = 3.15, y3(x1) = 3.675, 
ya(x1) = 4.2, 
4.4—2.2 


Ay(2.2) = ———_ 0.55, 


so yo(X2) = 2.2, yı(x2) = 2.75, yo(x2) = 3.3, y3(x2) = 3.85, 
ya(x2) = 4.4. 


(See Figure 5.128.) Thus, 


Th = ee?) [23 +22) +2 (22 +.2.57) + 3 +37) + (243.97) 
+(2°+4’)] 
! —— [2(2.13 + 2.17) + 4(2.13 + 2.6257) + 4(2.13 + 3.15?) 
+ 4(2.13 + 3.6752) + 2(2.1° + 4.2”)] 
| oe”) [(2.23 + 2.22) + 2(2.23 + 2.75%) + 2(2.23 + 3.32) 


+ 2(2.2° + 3.85°) + 2(2.2° + 4.4’)] 
= 0.0125(139) + 0.02625(156.7755) + 0.01375(175.934) = 8.271949375. 
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In this case, the exact answer is 


2.2 
= (t be)? = 028-29) + 4 22t—24 
= 8.23886. + 


If D is an elementary region of type 2, that is, 
D = {(x, y) E R? | a(y) <x < BY), c< y <d), 
then we may similarly approximate ff p f dA by first partitioning the y-interval 
[c, d] using 
c 
Ay = —, c=y0<y1 <...< Yn =d, where yj =c + jAy, 
n 


and then, for a fixed y in [c,d], by partitioning the corresponding x-interval 
[a(y), B(y)] into m equal subintervals: 


B(y) — ay) 


m 


Ax(y) = 


a(y) = x0 < xı <... < Xm = (y), where x;(y) = a(y) + i Ax(y). 


In doing so, we obtain a counterpart formula to that of (8), namely, 


A A m-—l1 
fy [reo 9+2 fa) + FBC). o| 
i=l 


n—1 A JA m—1 
pE op i [zreo y) +4) f0) 
i=1 


j=l 


+2f(B(y;); ») (9) 


Ax(d)A m1 
T anos [rea d)+2 Y f(xi(d),d) + f(BM), J ' 
i=l 


Of course, we may also adapt Simpson’s rule for use in the case of double 
integrals over elementary regions. Moreover, we may derive similar methods for 
approximating triple integrals as well. In practice, however, other methods are 
often used that lend themselves to computer implementation. 

One such alternative technique is known as the Monte Carlo method. It is 
based on a result called the mean value theorem for double integrals: If f is 
a continuous function of two variables and D is a bounded and connected (i.e., 
one-piece) region in R?, then there is a point P € D such that 


Tf f(x, y)dA = f(P) - area of D. 
D 
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5.7 Exercises 


In Exercises 1—6, (a) use the trapezoidal rule approximation 
T> 3 to estimate the values of the given integrals, and (b) com- 
pare your results with the exact answers. 


3.1 
c fa í x? — 6y’) dy dx 
1.5 


xy? dy dx 


2.2 p16 y 
3. / l —dydx 
2 1 y 


4. [fr dude 


e9 dy dx 


0.2 px/3 
6. f / x cos y dy dx 
0 m/6 
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Note that it follows that 
p 
area of D’ 
so that, by Definition 6.1, we have that f(P) = [f ]avg, the average (mean) value 
of f on D. The Monte Carlo method for approximating ff p f GA is to select n 


points Pı, ..., P, in D at random and compute the average value f of f on just 


these points: 
P 1 n 
f=- 9 SP). 
i=l 
Then f will approximate [ f Javg and, hence, 


Tf fdAw~ (area of D) f = ee fep), 
D 


If the area of D is in turn difficult to determine exactly, it, too, may be estimated 
by situating D inside a rectangle R and selecting m points at random in R. If r 
denotes the fraction of these points lying in D, then area of D ~ r - (area of R). 
In an analogous manner, we may give Monte Carlo approximations for triple 
integrals of functions of three variables defined over solid regions in space. 


“+2 dy dx 


1.1 0.6 
9. / | e 
1 0 


m/4 px /2 
10. f / sin 2x cos 3y dy dx 
0 


m/4 m/4 
11. i f sin (x + y)dy dx 
0 0 


Ll px/4 
12. / f e* cos y dy dx 
1 Jo 


13. In Chapter 7 we will see that the area of the portion of 
the graph of f(x, y) for (x, y) in a region D in R? is 

gfe +f Via 

(a) Set up an appropriate iterated integral to compute 
the surface area of the portion of the paraboloid 
z = 4 — x? —3y* where (x, y) € [0, 1] x [0, 1]. 

(b) Use the trapezoidal rule approximation 74,4 to es- 
timate the surface area. 


given by the double integral ffp 


In Exercises 7—12, (a) use the approximation S22 from Simp- 


sons rule to estimate the values of the given integrals, and (b) 14. 
compare your results with the exact answers. 


0.1 70.3 
7. f (y* — xy’) dy dx 
—0.1 JO 
0.1 2 1 
8. —— J] dyd 
| I Ge) = 


Concerning the iterated integral 

fr? fn x + y) dy dx: 

(a) Calculate the trapezoidal rule approximation T) 4. 

(b) Use Theorem 7.3 to estimate the error in your ap- 
proximation in part (a). 


(c) Calculate the Simpson’s rule approximation S) 4. 


15. 


16. 


17. 


18. 


19. 


20. 


True/False Exercises for Chapter 5 


1. 
2. 


(d) Use Theorem 7.4 to estimate the error in your ap- 
proximation in part (a). 


Without either evaluating or estimating the integral 
i 0.7 In(x )d d hich ‘ ti š 

i doe y)dy dx, which approximation is more 
accurate: T44 or $2,2? Explain your answer. 


Suppose that the trapezoidal rule is used to estimate the 
value of f Da e* +2y dy dx. Determine the small- 
est value of n so that the resulting approximation 7, , 
is accurate to within 1074 of the actual value of the 


integral. 


Consider a ta e*-Y dy dx. 

(a) If the trapezoidal rule approximation T, „ is used 
to estimate the value of this integral, what is 
the smallest value of n so that the resulting 
approximation is accurate to within 1075 of the 
actual value? 


(b) Ifthe Simpson’s rule approximation Szn,2n is used 
to estimate the value of this integral, what is the 
smallest value of n so that the resulting approx- 
imation is accurate to within 1075 of the actual 
value? 


Concerning the iterated integral fis i (3x + 
Sy) dy dx: 
(a) Calculate the trapezoidal rule approximation 7) >. 


(b) Compare your result in part (a) with the exact 
answer. 


(c) Use Theorem 7.3 to explain your results in parts 
(a) and (b). 

Consider the iterated integral f a i ? ¥353 dy dx: 

(a) Calculate the Simpson’s rule approximation S$) 9. 

(b) Compare your result in part (a) with the exact an- 
swet. 


(c) Use Theorem 7.4 to explain your results in parts 
(a) and (b). 


In Exercises 20-25, (a) use the approximation T; 3 from the 
trapezoidal rule to estimate the values of the given integrals. 


0 p2 
/ J (x? + 2y’) dy dx 
=l J -x 


Every rectangle in R? may be denoted [a, b] x [c, d]. 


If f is a continuous function and f(x, y)> 0 ona 
region D in R?, then the volume of the solid in R? 
under the graph of the surface z = f(x, y) and above 


the region D in the xy-plane is 1 f(x, y)dA. 
D 


21 


22 


23 


24 


25 


26. 


3 
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True/False Exercises for Chapter 5 


m/4 peosx 
i Í (2x cos y + sin? x) dy dx 
0 s 


inx 


0.3 p2x 
: | / (xy — x’) dy dx 
0 x 


dy dx 


m/3 sin x x 
` Í 0 Jl—y? 


i / | ` sinxdxd y (Note the order of integration.) 
1 Jo 


1.6 p2y 
; / ! In(xy)dx dy 
1 y 


In this problem, you will develop another way to think 

about the trapezoidal rule approximation given in equa- 

tion (6). 

(a) Let L denote a general linear function of two vari- 
ables, that is, L(x, y) = Ax + By + C, where A, 
B, and C are constants. Set R = [a, b] x [c, d]. 
Show that 


J / LdA = (area of R)(average of the values of 
7 L taken at the four vertices of R). 


(Note that this gives an exact expression for the 
double integral.) 


(b) Suppose that f is any function of two variables that 
is integrable on R. Show that the trapezoidal rule 
approximation 7;,; to [{, f dA is 


Tı ı = (area of R)(average of the values of f 
taken at the four vertices of R). 


(c) Now let Ax =(b—a)/m, Ay =(d-—c)/n, 
and, for i=1,...,m, j=1,...,n, let Rij = 
[xi-1, Xi] x [yj-1, yjl, where Xj=at+ iAx and 
yj = c + jAy. Then we have 


[| ra -EÈ ff ra 


Use Tı ı to approximate each integral ff ry FdA 
and sum the results to obtain the formula for Tn,n 
given by equation (6). 


1/2 p2 
: / J y? sin (x) dy dx 
0 —1 


2 p1/2 
= l / y? sin (r x°) dx dy. 
-1/0 
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2 x 2 py 
4. f | Sdydx = | f 3 dx dy. 
0 J0 0 0 


i 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


2 y 2 x 
f | fæ dxdy = f / f(x, y)dy dx for 
o Jo o Jo 


all continuous functions f. 
0< / / sin (x* + y*)dx dy < x, where D denotes 
D 


the unit disk {(x, y) | x? + y? < 1}. 


2 x x 2 
| f V8 ydydx = | f yx? + ydxdy. 
0 Jo o Jo 


1 3 1 3 
/ | xe dy dx -(/ xe dx) (/ e” ay) 
-1 J0 =] 0 


The region in R? bounded by the graphs of y = x? 
and y=./x is a type 3 elementary region in the 
plane. 


The region in R? bounded by the graphs of y = sin x, 
y =cosx,x = 7/4,andx = 57/4 isa type 2 elemen- 
tary region in the plane. 


The region in R? bounded by the graphs of y? — x? — 
z? = 1 and 9x? + 4z? = 36 is a type 3 elementary 
region in space. 


I ©? + 1) dx dy gives the area of the region D, 
oe D = {(x, y) | (x = 2} + 3y? < 5}. 
| (y sin (x°) + 3) dx dy = 3/2, where D is the tri- 
Le with vertices (—1, 0), (1, 0), (0, 1). 


I (xty+2)dV =0. 
[-2,2]x[-1,1]x[-3,3] 
II («+ 2dV =0. 
[-2,2]x[0,1]x[—1,1] 
Tf (y+2)dV =0. 
[-2,2]x[0,1]x[—1,1] 
5/2 p2 
[ m / sin yz dz dx dy = 0. 
-2 


T= s/1=x2 =y? s/1=x2 =y? 
[ Z oe (y —x°)dzdydx =0. 
J 1-x—y? x=? 


IfT(u, v) = 
age D = T(D*) ofthe unit square D* 
is 7 square units. 


IfT(u, v) = 
age D = T(D*) of the rectangle D* 
is 5 square units. 


(2u — v, u + 3v), then the area of the im- 
= [0, 1] x [0, 1] 


(v — u, 3u + 2v), then the area of the im- 
= [0, 3] x [0, 2] 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


5—2y 
[ l, e™™ cos(x — 3y) dx dy 
10 pu/2 
=) | —5e" cos v dv du. 
0 u—5 


If D is the disk {(x,y)|x? +y? <9}, 


Tf V9 — x? — y? dA = 367. 
D 


then 


2m p2 p4 
The iterated integral f | / dz dr d0 represents 
2r 


the volume enclosed by the cone of height 4 and 
radius 2. 


The iterated integral 


Lf OT [OO es VEER eaves 


is given by an equivalent integral in cylindrical coor- 
dinates as 


2n 2 JV9—r2 
f f f (r? + 2r)dzdr dé. 
0 0 0 


The iterated integral 


JV2=x? 4x =y? x?—y? 
~ | Vx? +y? +z? +5dzdydx 
x+y? 


is given by an equivalent integral in spherical coordi- 
nates as 


d m/4 p2 
f / fe p2 +5 sing dp dọ d0. 
0 0 0 


The average value of the function f(x, y) = x?°y 
over the semicircular region D = {(x, y)|O0<y< 


V/4 — x°} is given by 


1 m/2 p2 

4: 2 
=f f r“ sin@ cos’ 6 dr d 
T Jo 0 
1 m 2 

4; 2 
>f f r“ sinô cos’ 0 dr dé. 
T Jx/2J0 


The center of mass of a lamina represented by the tri- 
angle with vertices (2, 0), (0, 1), (0, —1), and whose 
density varies as 6(x, y) = (x? + 1) cos y, has coordi- 
nates given by 


or by 


B L Cae + x) cos y dy dx B 
X= 3 rC : =0. 
fie (e-2)/2 (x2 + 1)cos y dy dx 


Sg 


The centroid of a cone of radius a, height h, with axis 
the z-axis and vertex at (0, 0, A) is (0, 0, 3h). 


The center of mass of the solid cylinder of radius a, 
height h, with axis the z-axis whose density at any point 


varies as e”, where d is the square of the distance from 
the point to the z-axis is (0, 0, Z), where 


20 
-= J K " cre” dzdr do ziy 
I h K re” dzdr d0 : 


1. Let B be the ball of radius 3; that is, 


B = {[(x, y, z) | x? + y? +27 < 9}. 


Without resorting to any explicit calculation of an iter- 
ated integral, determine the value of the triple integral 
JIT 7 (23 + 2)dV by using geometry and symmetry 
considerations. 


. Let W denote half of the solid ball of radius 2; that is, 
W = {(x, y, z) | x? +y? +z? <4, z > 0}. 


Without resorting to explicit calculation of an iterated 
integral, determine the value of the triple integral 


Tf We +y—3)dV. 


(Hint: Use geometry and symmetry.) 


. Let W be the solid region in R? with x > 0 that is 

bounded by the three surfaces z = 9 — x7, z = 2x? + 

y?, and x = 0. 

(a) Set up, but do not evaluate, two different (but 
equivalent) iterated integrals that both give the 
value of [ffy 3dV. 


Q (b) Use a computer algebra system to find the value 


of ff fy 3 dV and to check for consistency in your 
answers in part (a). 


4. Suppose that f is continuous on the rectangle R = 


[a, b] x [c, d]. For (x, y) € (a, b) x (c, d), we define 
P a y 
Fe = f | fo! yay ax. 
Use Fubini’s theorem to show that 3? F/ðxðy = 


Ə? F/ðyəðx. This provides an alternative proof of the 
equality of mixed partials. (Hint: Write 


Fans) = f sayar 
where 
g’, y) = [ f’, y)dy’. 
Then ðF/ðx and 0?F/dydx may be calculated using 


the fundamental theorem of calculus. Then use Fubini’s 
theorem to find dF /dy and 07 F/dxdy.) 


30. 
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10. 


11. 
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The integral I p° sin 2g sing dp dg dé repre- 
w 


sents the moment of inertia about the z-axis of a solid 
W with density z, expressed in spherical coordinates. 


. Convert the following cylindrical integral to equivalent 


iterated integrals in (a) Cartesian coordinates and (b) 
spherical coordinates: 


EEL” 


Evaluate the easiest of the three iterated integrals. 


r dzdr dð. 


. The volume of a solid is given by the iterated integral 


/4y—y? y af 16—x2=y? af 16—x2=y? -y? 
[ f dz dx dy. 
—4/ 16—x?— y? 
(a) Sketch the solid and also describe it by giving equa- 
tions for the surfaces that form its boundary. 


(b) Express the volume as an iterated integral in cylin- 
drical coordinates. Determine the volume. 


- Calculate the volume of a cube having edge length a 


by integrating in cylindrical coordinates. (Hint: Put the 
center of the cube at the origin.) 


- Calculate the volume of a cube having edge length a 


by integrating in spherical coordinates. 


[foo (2a 


where D is the triangular region bounded by the coor- 
dinate axes and the line x + y = 1. 


. Determine 


Evaluate 
1—2y 5 
[ Í, y(x + 2yPre%*2) dx dy 


by making a suitable change of variables. 


Find the area enclosed by the ellipse E given by the 
equation 


in the following way: 

(a) First, write the area as the value of an appropriate 
iterated integral in Cartesian coordinates. Do not 
evaluate this integral. 

(b) Next, scale the variables by letting x = ax, 
y = by. To what region E* in the xy-plane does 
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12. 


13. 


14. 


15. 


16. 


17. 


18. 
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the ellipse E correspond? Rewrite the x y-integral 
in part (a) as an x y-integral. 

(c) Finally, use polar coordinates to transform the x y- 
integral and thereby show that the area inside the 
original ellipse is mab. 


This problem concerns the rotated ellipse E with equa- 
tion 13x? + 14xy + 10y? = 9. 


(a) Let u = 2x — y, v = x + y and rewrite the equa- 
tion for E in the form 


u? is v? i 
ae b’ 
where a and b are positive constants to be 


determined. 


(b) Use an appropriate change of variables and the re- 
sult of part (c) of Exercise 11 to find the area en- 
closed by E. 


Consider the ellipse FE with equation 
5x? +6xy + 5y? = 4. Let u = x — y, v = x + y and 
follow the steps of Exercise 12. 


Imitate the techniques of Exercise 11 to find the volume 
enclosed by the ellipsoid E given by the equation 


Xx 


y 
a 


xy 
dA, 
II, y? =x? 


where D is the region in the first quadrant bounded 
by the hyperbolas x? — y? = 1, x? — y? =4 and 
the ellipses x? /4 + y? = 1, x?/16 + y?/4 = 1. (Hint: 
Sketch the region D, and use it to make an appropriate 
change of variables.) 


II (x? + ye” dA, 
D 


where D is the region in the first quadrant bounded 
by the hyperbolas x? — y? = 1, x? — y? = 9, xy = 1, 
xy = 4. 


Evaluate 


Evaluate 


Evaluate 


1 
—— dA, 
i. aye 


where D is the region bounded by xy = 1, xy = 4, 
y=ly=2. 


(a) Generalizing the notions of double and triple inte- 
grals, develop a definition of the “quadruple inte- 
gral” [fff f dV ofa function f(x, y, z, w) over 


a four-dimensional region W in R*. 


(b) Use your definition in part (a) to calculate 


1 (x + 2y + 3z — 4w)d V, 
w 


where W is the four-dimensional box 


W = {(x,y,z,w)|0 <x <2, -l<y <3, 


0<z<4 —2<w<2}. 


19. (a) Setup, but do not evaluate, a quadruple iterated in- 
tegral that computes the four-dimensional volume 
of the four-dimensional ball of radius a: 


B = {(x, y, z, wW) | x? +y +z +w <a’}. 


ap (b) Use a computer algebra system to give a formula 
for the volume. 


<p (c) Use a computer algebra system to give for- 
mulas for the n-dimensional volume of the 
n-dimensional ball 


B = {(x1, X2,..-%m) | 47 +2 +--+ +22 < 07} 


in the cases where n = 5, 6. Is there any pattern to 
your answers? 


In Exercises 20-23, you will give a general expression for the 
n-dimensional volume of the n-dimensional ball 


B = {(x1, X2,- - -5 Xn) | X7 +39 Hrg Sa}. 


Let V,,(a) denote this n-dimensional volume. Let C,, denote the 
n-dimensional volume of the unit ball U; that is, C, = V,(1). 


20. By scaling the variables x1, x2,...,Xņn and using a 
change of variables formula analogous to those in The- 
orems 5.3 and 5.5, show that V,,(a) = C,a". 


21. (a) Consider points in B of the form (x1, x2, 0,..., 0). 
Show that the coordinates x1, x2 describe points 
(in RÊ) lying in the disk of radius a. 
(b) In R”, let the polar coordinates r and 0 replace 
the Cartesian coordinates x; and x2. Argue that 
the points in B lying over a particular point (r, 0) 
in the disk described in part (a) must fill out an 
(n — 2)-dimensional ball of radius va? — r?. 


(c) Use part (b) to show that the n-dimensional volume 
of B is given by 


20 a 
f | V,-2(v a2 — r?)r dr dé. 
0 0 


22. Use the previous two exercises to establish the recur- 
sive formula 


2na? 


V,(a) = ( ) V,-2(a). 


23. (a) Show that V|(a) = 2a and V2(a) = ma’. (These 
are familiar facts.) 


(b) Use part (a) and the previous problem to show that 


( tl? ) 
a" ifn is even 
| 
V,(a) = 2) 
Qnt+1)/2 z (n—1)/2 
(E) a” ifn is odd 
n!! 
where the double factorial n!! = n(n — 2)(n — 
4)---3-1 (.e., the product of all odd integers 
from 1 to n). 


24. A spherical shell with inner radius 3 cm and outer 


radius 4 cm has a mass density that varies as 

0.12d? g/cm? , where d denotes the distance (in cen- 

timeters) from a point in the shell to the center of the 
shell. 

(a) Determine the total mass of the shell. 

(b) Will the shell float in water? (Note: The density 
of water is 1 g/cm’. To answer this question, you 
need to determine the average density of the shell.) 

(c) Suppose that the shell has a small hole so that the 
core of the shell fills with water. Now will it float? 


. A dome is shaped as a hemisphere. If a pole whose 
length is the average height of the dome is to be in- 
stalled inside the dome in an upright position, where 
on the floor can it be located? 


. Let f be continuous on R = [a, b] x [c,d]. In this 
problem, you will establish Leibniz’s rule for “differ- 
entiating under the integral sign”: 


d b 


b 
L | fæ, ydx = / Deo 
dY Ja a 


(a) Let GO) = S? fŒ, y’)dx. For c < y < d, use 
the fundamental theorem of calculus to compute 
d/dy f? G(y")dy' and, therefore, 


d P fÈ 
£f | fy(x, y) dx dy’. 
y C a 


(b) Use Fubini’s theorem and part (a) to establish 
Leibniz’s rule. 


. The function f(x, y) = 1/,/xy is unbounded when ei- 
ther x or y is zero. Thus, if D = [0, 1] x [0, 1], we 


1 
say that I —— dA is an improper double inte- 
D N XY 
gral, analogous to the one-variable improper integral 
1 
1 
f — dx. Improper multiple integrals of this type 
o Vx 


may be evaluated using an appropriate limiting pro- 
cess. In this problem, you will determine the value of 


1 
—— dA in the following manner: 
JÍ, 7> 


28. 


29. 


30. 


31. 


32. 
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(a) For 0<e< 1/2, 0<ô< 1/2, let Des = 
fe, 1 — e€] x [ô, 1 — ô]. (Note that Des C D.) 
Calculate 


d= ff aaa 


(b) Evaluate lim  /(e, 45). You should obtain a fi- 
(€,6)—(0,0) 


nite value, which may be taken to be the value of 


1 
—— dA, since De 5 “fills out” D as (€, 5) > 
| [ VY i 


(0, 0). (We say that in this case the improper inte- 
gral converges.) 


Imitate the techniques of Exercise 27 to determine if 
the improper double integral 


1 
J f dA 
[0,1]x[0,1] ¥ + Y 


converges and, if it does, find its value. (Hint: You will 
need to determine lim„,—o+ u ln u.) 


Imitate the techniques of Exercise 27 to determine if 
the improper double integral 


X 
/ f x aA 
[0,1]x[0,1] Y 


converges and, if it does, find its value. 


Calculate ffp lnyx? + y?dA, where D is the unit 


disk x? + y? < 1. Note that the integrand is not de- 
fined at the origin, so this is an example of an improper 
double integral. Nonetheless, you can find its value by 
integrating over the annular region 


D: = {(x, y) |€ < x? +y? < 1} 


and taking appropriate limits. 


Find the value of the improper triple integral 


Ill Infx? + y2+22dV, 
B 


where B is the solid ball x? + y? + z? < 1. (See Exer- 
cise 30.) 


If D is an unbounded region in R?, the integral 
Jp f(x, y)dA is another type of improper double 
integral, analogous to one-variable improper integrals 


suchas [~ f(x)dx, ftn f(x) dx, or [%, f(x) dx. In 
this problem, you will determine the value of 


1 
A; 
I, x?y3 


where D = {(x, y)|x > 1, y > 1} using a limiting 
process. 
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33. 


34. 


35. 


36. 


37. 
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(a) Fora > 1,b > 1, let Da.» = [1, a] x [1, b]. Com- 


pute 
I( =| dA 
a, = Das x2y3 i 


(b) Evaluate lim Z(a, b). You should obtain a 
(a,b) (00,00) 
finite value, which may be taken to be the value 


1 
of I / —— dA. (In such a case, we say that the 
p X?y? 


improper integral converges.) 


Let D = {(x, y) | x > 1, y > 1}. For what values of p 
and q does the improper integral 


1 
dA 
[sm 


converge? For those values of p and q for which the 
integral converges, what is the value of the integral? 


This problem concerns the improper integral 


where p is a constant. 


(a) Determine if the integral converges when p = —2 
by integrating over the disk Da = {(x, y) | x? + 
y? < a’} and then letting a > oo. 


(b) Determine for what values of p the integral 
Sfp + x? + y?)P dA converges. What is the 
value of the integral when it converges? 


Determine if 


1 
dV 
His (l+x2 + y? + 22)3 


converges by integrating over the ball B, = {(x, y, z) | 
x? + y? +z? < a°} and then letting a > ov. 


[ly Re 


converges by determining its value. 


Show that 


In this problem, you will find the value of the one- 
variable improper integral f ae e`? dx by using two- 
variable improper integrals. 

(a) First argue that eS e dx converges. (Hint: 


Note that e < 1/x? for all x; compare 
integrals.) 


(b) Let 7 denote the value of {°° e~*’ dx. Show that 


2n oe 2 2 2 2 
P =f J e*I dxdy= Tf e™ > dA. 
E 65 R? 


(c) Let Da denote the disk x? + y? < a?. Evaluate 
S Maree 
JJe T dA; 


(d) Compute J? as limao JIo, en -Y dA. 
(e) Now find [Ù e=% dx. 


Exercises 38—46 involve the notion of probability densities. A 
probability density function of a single variable is any func- 
tion f(x) such that f(x) > Oforallx € R, and f% f@a=l. 
Given such a density function, the probability that a randomly 
selected number x falls between the values a and b is 


b 
Prob(a < x < b) =f f(x)dx. 


38. (a) Check that f(x) = e~?"! is a probability density 
function. 

(b) Egbert turns on the stove in a random manner to 
heat cooking oil to fry chicken. If the probability 
density of the temperature x of the oil is given by 
f(x) = 5e7 30, what is the probability that the 
oil has a temperature between 250°F and 350°F? 


A joint probability density function for two random variables 
x and y is a function f(x, y) such that 


(i) f(x, y) > 0 for all (x, y) € RÈ, and 

Gi) Sfere Fœ y)dA= fry. fu, FG, y) dx dy=1. 
If f is such a probability density and D is a region in R?, then 
the probability that a randomly chosen point (x, y) lies in D is 


Prob((x, y) € p= | ff f(x, y)dA. 


39. (a) Show that the function 


2x+y , 

—— if0<x<5,0<y<4 
fay 140 © SFS SyS 

0 otherwise 


is a joint probability density function. 
(b) Find the probability that x < 1 and y < 1. 
40. (a) Show that the function 


ye** ifx>0,y>0 


0 otherwise 


foun | 


is a joint probability density function. 
(b) What is the probability that x + y < 2? 


41. Ifa and b are fixed positive constants, what value of C 
will make the function f(x, y) = Ce~@*!~)! a joint 
probability density function? 


42. Let a and b be fixed nonnegative constants, not both 
zero. For what value of C is 


C(ax+by) if0<x<1,0<y<1 


fon | 


0 otherwise 


a joint probability density function? 


43. 


44. 


45. 


46. 


Let a and b be fixed positive constants and, for a given 
constant C, consider the function 


Cxy if0<x<a,0<y<b 


0 otherwise 


fo | 


(a) For what value of C is f a joint probability density 
function? 


(b) Using the value of C that you found in part (a), 
what is the probability that bx — ay > 0? 


The research team for Vertigo Amusement Park deter- 
mines that the length of time x (in minutes) a customer 
spends waiting to participate in the new Drown Town 
water ride, and the length of time y actually spent in the 
ride, are jointly distributed according to the probability 
density function 


a e™¥/50-y/5 ify > 0. y> 0 

250 © ux= Uys 
fx, y) = 
0 otherwise 
Find the probability that a customer spends at most 
an hour involved with the ride (both waiting and 
participating). 


Suppose that you randomly shoot arrows at a circular 
target so that the distribution of your arrows is given 
by the probability density function 


1 2.2 
fay = e, 
IU 


where x and y are measured in feet. In the center of 
the target, there is a bull’s-eye that measures 1 ft in 
diameter. (See Figure 5.129.) What is the probability 
of your hitting the bull’s-eye? 


y 


Figure 5.129 The circular 
target of Exercise 45. The 
shaded region is the 
bull’s-eye. 


If x is a random variable with probability density func- 
tion f(x) and y is a random variable with probability 
density function g(y), then we say that x and y are 
independent random variables if their joint density 
function is the product of their individual density func- 
tions, that is, if 


F(x, y) = fœ). 


47. 


48. 


49. 
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Suppose that an electrical circuit is designed with two 
identical components whose time x to failure (mea- 
sured in hours) is given by an exponential probability 
density function 


0 ifx <0 


1_,—x/2000 
3000 


rw=] 


ifx>0 


Assuming that the components fail independently, 
what is the probability that they both fail within 2000 
hours? 


Let J, denote the moment of inertia of a solid W (of 
density 5(x, y, z)) about the line L. (See formula (8) in 
§5.6.) Let L be the line parallel to L that passes through 
the center of mass of W. Then, if M denotes the mass 
of W and h the distance between L and L, prove the 
parallel axis theorem: 


I, — I; = Mh’. 


(Hint: Without loss of generality, you can arrange 
things so that L is the z-axis.) 


Let F bea function of one variable that is continuous on 

the interval [a, b]. Then the function f(x, y) = F(x) 

(i.e., the function F considered as a function of two 

variables) is continuous on the rectangle R = [a, b] x 

[c, d]. 

(a) Show that the trapezoidal rule approximation Tnn 
to Sfr f(x, y)dA is Tm, = (d — c)Tm, where Tm 
denotes the trapezoidal rule approximation to the 
definite integral f F(x) dx. 

(b) Similarly, show that the Simpson’s rule approxi- 
mation S2m,2n to te f(x, y) dA is S2m,2n = (d = 
c)S2m, Where S2m denotes the Simpson’s rule ap- 
proximation to the definite integral J F(x) dx. 


Suppose that f is a function of one variable that is 
continuous on [a, b], and g is another function of one 
variable that is continuous on [c, d]. Then, from Exer- 
cise 40 in §5.2, we know that 


— FR) aA 


(fro) (fs). 


Show that this same product property holds for 
the trapezoidal rule approximation. That is, if 
Tm, n denotes the trapezoidal rule approximation 
to Sras)xte.d) S(x)g(y) dA, then Tin,n = Tn(f)T(8), 
where T,,(f) denotes the trapezoidal rule approxima- 
tion to J? f(x)dx and T,(g) the trapezoidal rule ap- 


proximation to i g(y)dy. 


6.1 Scalar and Vector Line 
Integrals 


6.2 Green's Theorem 
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x(a) = As, = arclength 
x(t) of segment 


Figure 6.1 The sum 

ra J (x(t) As; approximates 
the total charge along an idealized 
wire described by the path x. 


Line Integrals 


6.1 Scalar and Vector Line Integrals 


In this section, we describe two methods of integrating along a curve in space and 
explore the meaning and significance of our constructions. The main definitions 
are stated first for parametrized paths. Ultimately, we show that the integrals 
defined are largely independent of the parametrization, that instead they reflect 
essentially only the geometry of the underlying curve. 


Scalar Line Integrals 


To begin, we find a way to integrate a function (a scalar field) along a path. 
Let x: [a, b] > R? be a path of class C!. Let f: X C R? — R be a continuous 
function whose domain X contains the image of x (so that the composite f(x(t)) 
is defined). As has been the case with every other integral, the scalar line integral 
is a limit of appropriate Riemann sums. 

Let 


A=th<t)<-:+-<t<:::<t,=) 


be a partition of [a, b]. Let t be an arbitrary point in the kth subinterval [t,_1, tk] 
of the partition. Then we consider the sum 


Yo FOP)AS. (1) 
k=1 


tk 


where As, = de. xol dt is the length of the kth segment of x (i.e., the 
portion of x defined for %_; < t < tk). If we think of the image of the path x 
as representing an idealized wire in space and /(x(1;)) as the electrical charge 
density of the wire at a “test point” x(t¥) in the kth segment, then the product 
J (x(t{)) As; approximates the charge contributed by the segment of curve, and 
the sum in (1) approximates the total charge of the wire. (See Figure 6.1.) To find 
the actual charge on the wire, it is reasonable to take a limit as the curve segments 
become smaller; that is, 


all As,—> 


Total charge = lim oe SARE) As, 
k=l 


= all R 2 FOC ASk, (2) 


since x is of class C!. 
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The mean value theorem for integrals! tells us that there is some number te* 
in [t,—1, tk] such that 


As, = | l [Ol| dt = (tk — t-d [e] = e] At. 


th-1 


(Here At, denotes fk — tk—-1.) Since tř is an arbitrary point in [t,_1, tk], we may 
take it to be equal to 1“. Therefore, by substituting for As; in equation (2), and 
letting tf equal t;*, we have 


Total charge = z imi a 3 fx") |x| At 


b 
= | FOA [xO || at. 


This last result prompts the following definition: 


DEFINITION 1.1 The scalar line integral of f along the C! path x is 


b 
i f(x) |x|] at. 


We denote this integral f, f ds. 


EXAMPLE 1 Let x:[0, 277] —> R? be the helix x(t) = (cost, sint, t) and let 
f(x, y, z) = xy +z. We compute 


27 
i fas= f f(x(t)) [xO dt. 


First, 


x(t) = (— sint, cost, 1), 
so that 


[xO] = vsin? t + cos? t +1 = V2. 
We also have 
fO) = cost sint + t = 5 sin 2t + t 


from the double-angle formula. Thus, 


27 2x 
fras] (}sin22 +1) Var = v3 f (4 sin 2r +t) dt 
x 0 0 


ee) (—{ cos 2t + it”) j = V2 (( 


Ale 
N 
S] 

N 
— 
— 
Ble 

© 
— 
Nee? 


= 2/2 n?. + 


l Recall that this theorem says that if F is a continuous function, then there is some number c with 
a < c < b such that J F(x)dx = (b — a)F (c). 
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(2,1) 


(4, =1) 


Figure 6.2 A piecewise C! 


path x. 


Given the discussion preceding the formal definition ofthe scalar line integral, 
it is both convenient and appropriate to view the notation f, f ds as suggesting 
that the line integral represents a sum of values of f along x times “infinitesimal” 
pieces of arclength of x. 

Definition 1.1 is made only for paths x in R? and functions f defined on 
domains in R°. Nonetheless, for arbitrary n, we may certainly use the definite 
integral 


b 
[ sown [xo ar. 


where x is a C! path in R” and f is an appropriate function of n variables. We 
call this definite integral the scalar line integral as well (and maintain the notation 
f f ds) and rely on the context to make clear the dimensionality of the situation. 
Also, if x is not of class C!, but only “piecewise C!” (meaning that x can be 
broken into a finite number of segments that are individually of class C!), then 
we may still define the scalar line integral f f ds by breaking it up in a suitable 
manner. A similar technique must be used if f(x(t)) is only piecewise continuous. 


EXAMPLE 2 Let f(x, y) = y — x and let x: [0, 3] + R? be the planar path 


s- [e9 if0<1<1 
UNS gi LI ig al eres 


Hence, x is piecewise C! (see Figure 6.2); the two path segments defined for ¢ in 
[0, 1] and for ¢ in [1, 3] are each of class C!. Thus, 


[ras= f fas+ f ras. 


where x(t) = (2t, t) for 0 < t < 1 and x2(t) = (t + 1,5 — 4t) fr 1 <t <3. 
Note that 


KOl = v5 an [4o] = V17. 


Consequently, 
1 
1 1 
/ fds= | FOA [XO || dt = / (t —2t)- J/5dt = “5a = d 
Xi 0 0 2 E 2 
Also, 
3 3 
/ fds= | f(x2(t)) [LO] dt = I (5 — 4t) — (t+ 1)V17 dt 
X2 1 1 
= V17 (4t — $e) = —12V77. 
Hence, 
fras=-Ż -v7 + 
Vector Line Integrals 


Now we see how to integrate a vector field along a path. Again, let x: [a, b] —> R” 
be of class C!. (n will be 2 or 3 in the examples that follow.) Let F be a vector 


7 (straight-line 


i, 


A 


Figure 6.3 The work done by F 
in moving a particle from A to B 
in a straight line is F+ As. 
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field defined on a subset X of R” such that X contains the image of x. Assume 
that F varies continuously along x. 


DEFINITION 1.2 The vector line integral of F along x: [a, b] — R”, de- 
noted f° F + ds, is 


F-a = f Fe xoar 


We caution you to be clear about notation. In the vector line integral f, F - ds, 
the differential term ds should be thought of as a vector quantity (namely, the 
“differential” of position along the path), whereas in the scalar line integral 
f f ds, the differential term ds is a scalar quantity (namely, the differential of 
arclength). 


EXAMPLE 3 LetF be the radial vector field on R? given by F = xi + yj + zk 
and let x: [0, 1] + R? be the path x(t) = (t, 317, 2t°). Then x'(t) = (1, 6t, 6t°), 
and so 


1 
fras] F(x(t))-x'(t)dt 
x 0 
1 
= f (ti + 3t j+ 2k). (i+ 6tj + 607k) dt 
0 


1 
= (t + 1847 + 12t°) dt = (412 + 344 4. 279)|0 = 7. 
0 


As with scalar line integrals, we may define f F - ds when x is only a piece- 
wise C! path by breaking up the integral in a suitable manner. 
Definition 1.2 is important for the following reason: 


Physical interpretation of vector line integrals. Consider F to be a force 
field in space. Then ie F - ds may be taken to represent the work done by F 
on a particle as the particle moves along the path x. 


To justify this interpretation, first recall that, if F is a constant vector field 
and x is a straight-line path, then the work done by F in moving a particle from 
one point along x to another is given by 


Work = F- As, 


where As is the displacement vector from the initial to the final position. (See 
Figure 6.3.) 

In general, the path x need not be straight and the force field F may not 
be constant along x. Nonetheless, along a short segment of path, x is nearly 
straight and F is roughly constant, by continuity. Partition [a, b] as usual (i.e., 
take a = tọ <- < tk <- <t =b) and focus on the kth segment. 
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F(x(¢,)) (See Figure 6.4.) Then 


Work done along kth segment ~ F(x(t¥)) + Axx. 


Since 

x(t + At) — x(t) _ Ax 

= lim —, 
At At>0 At 


we must have, for At = tk — 1 © O and &_1 < të < tk, that 
Figure 6.4 Approximating the 


work along a segment of the x(t) © X(t) — X(tk-1) = AX 
path x. k tk — th—1 At 


ae x(t) = lim 
X(t,_1) x(t) * alate 


Hence, 


Total work ~ X` F(x(tf))+ Axr © X FOM) +x) Ate. 
k=1 k=1 


Therefore, it makes sense to take the limit as all the At,’s tend to zero and define 
the total work by 


k= ii F(x(t)) -x (At 
Wor T D3 (x(1)) x At 


b 
= / F(x(t)) + x(t) dt 


= | Fas 


Other Interpretations and Formulations 

Suppose x: [a, b] > R” is a C! path with x(t) Æ 0 for a < t < b. Recall from 

§3.2 that we define the unit tangent vector T to x by normalizing the velocity: 
x'(t) 

xol 


We may insinuate this tangent vector into the vector line integral as follows: From 
Definition 1.2, we have 


T(t) = 


b 
frais=] F(x(t)) -x (t)dt. 


Thus, 


B b x(t) , 
[ra= | FEO) zoj Ol dt 


b 
= J ERC) TA) [LO] adt 


= [en ds. (3) 


Since the dot product F - T is a scalar quantity, we have written the original vector 
line integral as a scalar line integral. We also see that f, F -ds represents the 


h\\\ 1 


Figure 6.5 The vector line integral 
J, F -+ ds equals f (F - T) ds, the 
scalar line integral of the tangential 
component of F along the path. 
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(scalar) line integral of the tangential component of F along the path—that is, 
how much ofthe vector field the underlying curve actually “sees.” (See Figure 6.5.) 

An important interpretation of the vector line integral occurs when x is a 
closed path (i.e., when x(a) = x(b)). In this circumstance, the quantity f F-ds 
is called the circulation of F along x. To understand this idea better, suppose 
that F represents the velocity vector field of a fluid. Consider the amount of fluid 
moved tangentially along a small segment of the path x during a brief time interval 
At. (We use Tt to denote time here so as not to conflict with the use of t for the 
parameter variable of the path x.) Since F- T gives the tangential component of 
F, we have that 


Amount of fluid moved © (F(x(t))At + T(t)) As, (4) 


where As is the length of the segment of the closed path x. (See Figure 6.6.) 
Formula (4) is only approximate because the segment of the path need not be 


Figure 6.6 The amount of fluid 
transported tangentially along a segment 
of the closed path x is approximately 
(F(x(t))At + T(t)) As. 


completely straight (so that T(t) may not be a constant vector), and also because 
the vector field F need not be constant over the segment of the path. If we divide 
the term in (4) by At, then the average rate of transport along the segment 
during the time interval At is (F(x(t)) + T(t)) As. If we now partition the closed 
path x into finitely many such small segments and sum the contributions of the 
form (F(x(t))+T(t)) As for each segment, then let all the lengths As tend to 
zero, we find that the average rate of fluid moved, denoted AL/At, is given 
approximately by 


AL 


— & F-T)ds. 
Al K jäs 


Finally, if we let At — 0, we may define the (instantaneous) rate of fluid moved, 


dL/dt, to be 
L 
<= f@nas= f F-as, 
dt í Ñ 


which is what we have called the circulation. 
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en F EXAMPLE 4 Thecircle x? + y? = 9 may be parametrized by x = 3 cost, y = 
F=xi+ yj 3 sint, 0 < t < 27. Hence, a unit tangent vector is 
—3sinti+3costj —yi j 
T= De AT CENT P ae 
a xX2+y=9 V9 sin? t + 9 cos? t 3 
Now consider the radial vector field F = x i + y j on R?. At every point along the 
x circle we have 
F-T=(xit+yj)- (==) =0. 
Thus, F is always perpendicular to the curve. Therefore, 
Figure 6.7 At every point along 
the circle, F = x i + y j has no [Fas T [E-Das — [oas =0 


tangential component. to ‘ but : 
and, considering F as a force, no work is done. Considering F as a velocity field, 


the circulation of F along x is likewise zero. (See Figure 6.7.) 
On the other hand, if instead F = yi — 2x j, then 


at) O —y? — 2x? 


3 3 


This quantity will always be negative on the circle, so we expect the circulation 
to be negative. In particular, we have 


2 2 2 
[v-as= [Enas = [tas 


Using the parametrization given above, we have ||x’(t)|| = 3, so that 


24.95? 2x Qsin? t + 18 cos? t 
J-a =f .3dt 
x 3 0 3 


F-T=(yi-2xj)- ( 


20 20 
= -9 f (sin? t + 2cos*t)dt = -9 f (1 + cos? t) dt 
0 0 


20 
= -9 f (1+ 40 + cos 2t)) dt 
0 
* = 277. + 


Next, suppose that x(t) = (x(t), y(t), z(t), a < t < b, is a C! path and 
F(x, y, z) = M(x, y, Jit N(x, y, Dit Px, y, Ok 


is a continuous vector field. Then, from Definition 1.2 of the vector line integral, 


we have 
b 
[ras | (M(x, y, i+ N(x, y,z)j+ P(x, y,z)k) 


KHI y Oj +z AOk)dt 


b 
= { [M(x, y, Dx A) + N(x, y, Dy(t) + P(x, y, Dz(0)] dt. 


a 


Recall that the differentials of x, y, and z are 
dx = x'(t)dt, dy = y'(t)dt, dz = z'(t)dt. 
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Hence, 


[Pease | MG, y dx + NG, y, dy + PE, y, adz. 


The integral 
f Mdx+Nay+ Paz 


is a notational alternative to f F - ds. Indeed, the former is defined by the latter. 
The integral 


[Max+ way + Paz 


is commonly referred to as the differential form of the line integral. (In fact, the 
expression M dx + N dy + P dz is itself called a differential form.) It empha- 
sizes the component functions of the vector field F and arises regularly in applica- 
tions. Be sure to interpret M dx + N dy + P dz carefully: It should be evaluated 
using the parametric equations for x, y, and z that come from the path x. 


EXAMPLE 5 We compute 


fo Hz)dx +(x +z)dy + (x + y)dz, 


where x is the path x(t) = (t, t, #°) for0 <t <1. 
Along the path, we have x = t, y = t°, and z = £’ so that dx = dt, dy = 
2t dt, and dz = 3t? dt. Therefore, 


[otdd ++ ddy + E+ paz 
1 
= Í (7? + P)dt + (t+ PNR2tdt + (t+ t°)30? dt 
0 
i 1 
F G +4 +3 )dt = (C + + P) =3. n 
0 


Line Integrals Along Curves: 
The Effect of Reparametrization 


Since the unit tangent vector to a path depends on the geometry of the underlying 
curve and not on the particular parametrization, we might expect the line integral 
likewise to depend only on the image curve. We shall see precisely to what degree 
this observation is true generally for both vector and scalar line integrals. 

We begin with an example. Consider the following two paths in the plane: 


x:[0, 277] > R?, x(t) = (cost, sint) 
and 
y: [0,7] —> R?, y(t) = (cos 2t, sin2r). 
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Itis not difficult to see that both x and y trace out a circle once ina counterclockwise 
sense. In fact, if we let u(t) = 2t, then we see that y(t) = x(u(t)). That is, the path 
y is nothing more than the path x together with a change of variables, which 
suggests the following general definition: 


DEFINITION 1.3 Let x:[a, b] —> R” be a piecewise C! path. We say that 
another C! path y:[c,d]—> R” is a reparametrization of x if there is 
a one-one and onto function u: [c,d] — [a, b] of class C!, with inverse 
u`!: [a, b] > [c, d] that is also of class C!, such that y(t) = x(u(t)); that is, 
7 rou 


Reflecting on Definition 1.3 should convince you that any reparametrization 
of a path must have the same underlying image curve as the original path. 


EXAMPLE 6 The path 
x(f) = (14+ 27,2-1,3+51), O<1t<1, 


traces the line segment from the point (1, 2, 3) to the point (3, 1, 8). So does the 
path 


We 2 =F 345r), 0<t<1. 


We have that y is a reparametrization of x via the change of variable u(t) = t°. 
However, the path z:[—1, 1] —> R? given by 


z(t) = (1+ 207,2— 17,3 + 5t) 


is not a reparametrization of x. We have z(t) = x(u(t)), where u(t) = t°, but in 
this case u maps [—1, 1] onto [0, 1] in a way that is not one-one. 
On the other hand, 


w(t) = (3 — 2t,1+t,8— 5t) O0<+t<1, 


is a reparametrization of x. We have w(t) = x(1 — t), so the function u: [0, 1] > 
[0, 1] given by u(t) = 1 — t provides the change of variable for the reparametriza- 
tion. Geometrically, w traces the line segment between (1, 2, 3) and (3, 1, 8) in 
the opposite direction to that of x. + 


Ify: [c,d] — R” is a reparametrization of x: [a, b] —> R” via the change of 
variable u: [c,d] — [a, b], then, since u is one-one, onto, and continuous, we 
must have either 


(i) u(c) = a and u(d) = b, or 
(ii) u(c) = b and u(d) = a. 


In the first case, we say that y (or u) is orientation-preserving and, in the second 
case, that y (or u) is orientation-reversing. The idea is that if y is an orientation- 
preserving reparametrization, then y traces out the same image curve in the same 
direction that x does, and if y is orientation-reversing, it traces the image in the 
opposite direction. 


x(b) 


x(a) Xopp(D) 


Figure 6.8 A path and its 
opposite. 


Xopp(4) 
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EXAMPLE 7 Ifx: [a,b] — R” isany C! path, then we may define the opposite 
path Xop: [a, b] > R” by 


Xopp(t) = x(a +b — t). 


(See Figure 6.8.) That is, Xopp(t) = x(u(t)), where u: [a, b] —> [a, b] is given by 
u(t) = a + b — t. Clearly, then, Xopp is an orientation-reversing reparametrization 
of x. + 


In addition to reversing orientation, a reparametrization of a path can change 
the speed. This follows readily from the chain rule: If y = x o u, then 


d 
yo = To= x'(u(t)) u(t). (4) 


So the velocity vector of the reparametrization y is just a scalar multiple (namely, 
u'(t)) of the velocity vector of x. In particular, we have 


Speed of y = |y O| = [O xL uW) | 


= [w (O)| |x uA] = [w A)| - (speed of x). (5) 


Since u is one-one, it follows that either u/(t) > 0 for all t € [a, b] or u'(t) < 0 
for all t € [a, b]. The first case occurs precisely when y is orientation-preserving 
and the second when y is orientation-reversing. 

How does the line integral of a function or a vector field along a path differ 
from the line integral (of the same function or vector field) along a reparametriza- 
tion of a path? Not much at all. The precise results are stated in Theorems 1.4 
and 1.5. 


THEOREM 1.4 Let x:[a,b] > R” be a piecewise C! path and let f:X C 
R” — R be a continuous function whose domain X contains the image of x. 
If y: [c, d] —> R” is any reparametrization of x, then 


[vase [ ras 


PROOF We will explicitly prove the result in the case where x (and, therefore, y) 
is of class C!. (When x is only piecewise C!, we can always break up the integral 
appropriately.) In the C! case, we have, by Definition 1.1 and the observations in 
equation (5), that 


d d 
/ TE / Fly) ly] at = | Fex(u(t))) [KUD] O| dt. 
y c c 


Ify is orientation-preserving, then u(c) = a, u(d) = b, and lw O)| = u'(t). Thus, 
using substitution of variables, 


d d 
J FCU |x @@)| wola= f f(x(u(t))) |U | wA at 


b 
=i f(x(u)) xa du= | fas, 
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If, on the other hand, y is orientation-reversing, then u(c) = b, u(d) = a, and 
|u'(t)| = —u'(t), since u'(t) < 0. Therefore, in this case, 


d d 
/ F(x(u(t))) [LUO] |W] at = / FOUE) [LUE wO) dt 
e | FU) |x| du 


b 
z f(x(u)) |x| du= | fas, 


Hence, iL f ds = f, f ds in either case, as desired. m 


THEOREM 1.5 Letx:[a, b] > R” beapiecewise C! pathandletF: X C R” > 
R” be a continuous vector field whose domain X contains the image of x. Let 
y: [c, d] —> R” be any reparametrization of x. Then 


1. If y is orientation-preserving, then J F-ds= f F-ds. 
2. Ify is orientation-reversing, then Íy F-ds = —f F- ds. 


PROOF As in the proof of Theorem 1.4, we consider only the C! case in detail. 
Using Definition 1.2 for the vector line integral, equation (4), and substitution of 
variables, we have 


d d 
/ F-ds = J FOO) -y (©dt = 1 F(x(u(t))) -x (UE) u'®) dt. 
y 


c c 


If y is orientation-preserving, then u(c) = a, u(d) = b, so we have 


d b 
J PRUO- XUDOd = | FEU- x du = | F-ds 


c a x 


This proves part 1. If y is orientation-reversing, then u(c) = b, u(d) = a, so, 
instead, we have 


d a 
f FEUD XUDO di = f FED- x du =- f Fas, 
b 


c x 
which establishes part 2. a 


Simply put, Theorems 1.4 and 1.5 tell us that scalar line integrals are entirely 
independent of the way we might choose to reparametrize a path. Vector line 
integrals are independent of reparametrization up to a sign that depends only on 
whether the reparametrization preserves or reverses orientation. 


EXAMPLE 8 LetF = xi+ yj, and consider the following three paths between 
(0, 0) and (1, 1): 


x(t) = (t,t) <1 = 1, 

y(t) = (2t, 2t) O<t<5, 
and 

at) = (1 —-t,1—f) 0<r<l 
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The three paths are all reparametrizations of one another; x, y, and z all trace the 
line segment between (0, 0) and (1, 1)—x and y from (0, 0) to (1, 1) and z from 
(1, 1) to (0, 0). 

We can compare the values of the line integrals of F along these paths: 


i F-ds= [ F(x(t)) -x (t) dt 


1 
= Ci+tp-G+par= | 2tdt = |) = 1; 
0 0 


/2 1/2 
fras] Qri+2rj)-Qi+ apar= f 8t dt = 417|,"° = 1; 
y 0 0 


1 
[P-as= [ca Hi+(1—dj)-(-i-pat 
1 
ai 2t — 1)dt = (t — 1P = —1. 
0 


The results of these calculations are just what Theorem 1.5 predicts, since y is 
an orientation-preserving reparametrization of x and z is an orientation-reversing 
one. + 


The significance of Theorems 1.4 and 1.5 is not merely that they allow us 
occasionally to predict the results of line integral computations but also that they 
enable us to define line integrals over curves rather than over parametrized paths. 
To be more explicit, we say that a piecewise C! path x: [a, b] > R” is closed 
if x(a) = x(b). We say that the path x is simple if it has no self-intersections, 
that is, if x is one-one on [a, b], except possibly that x(a) may equal x(b). Then, 
by a curve C, we now mean the image of a path x: [a, b] —> R” that is one- 
one except possibly at finitely many points of [a, b]; the (nearly) one-one path x 
will be called a parametrization of C. Additionally, we will refer to the curve 
C as being closed or simple if it has a corresponding parametrization that is 
closed or simple. (See Figure 6.9.) It is a fact (whose proof we omit) that if x 
and y are both parametrizations of the same simple curve C, then they must be 
reparametrizations of each other. 


Ww Dg AA 


Not simple, not closed Simple, not closed 
Not simple, closed Simple, closed 


Figure 6.9 Examples of curves. 
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y 
(0, 3) 


(5,0) 


Figure 6.10 The ellipse 

x?/25 + y?/9 = 1 of Example 9 
is parametrized in the counter- 
clockwise sense by x(t) = 
(Scost, 3sint),0 < t < 27, and 
in the clockwise sense by y(t) = 
(5 cos 2(z — t), 3 sin2(z — t)), 
O<t<nz. 


Q 
4 es, 
or or 
Q 
Tad Co 
Figure 6.11 The possible 


orientations for a nonclosed and 
a closed, simple curve. 


EXAMPLE 9 The ellipse x?/25 + y?/9 = 1 shown in Figure 6.10 is a simple, 
closed curve that may be parametrized by either 


x: [0,27] > R?, x(t) = (5 cost, 3 sint) 
or by 
y:[0, x] > R?, y(t) = (5 cos 2(x — t), 3 sin2(x — t)), 


since both paths have the ellipse as image and each map is one-one, except at the 
endpoints of their respective domain intervals. Note that y is a reparametrization 
of x. 

However, the path 


z:[0, 6r] > R?, 2(t) = (Scost, 3 sint) 


is not a parametrization of the ellipse, since it traces the ellipse three times as t 
increases from 0 to 67. That is, z is not one-one on (0, 677). + 


Simple curves, whether closed or not, always have two orientations that cor- 
respond to the two possible directions of travel along any parametrizing path. (See 
Figure 6.11.) We say that a simple curve C is oriented if a choice of orientation 
is made. 

The reason for the preceding terminology is that if C is a (piecewise C!) 
simple curve, we may unambiguously define the scalar line integral ofa continuous 


function f over C by 
f tas = fi ds, 
C x 


where x is any parametrization of C. Theorem 1.4 guarantees that the choice of 
how to parametrize C will not matter. 

On the other hand, we can define the vector line integral only for oriented 
simple curves. If an orientation for C is chosen and x: [a, b] —> R” is a parame- 
trization of C that is consistent with this orientation, then we define the vector 
line integral of a continuous vector field F over C by 


[v-as= [reas 
G x 


Theorem 1.5 shows that this definition is independent of the choice of para- 
metrization of C, as long as it is made consistently with the given orienta- 
tion. Indeed, if C~ denotes the same curve as C, only oriented in the opposite 
way, then C~ is parametrized by Xopp (where x parametrizes C) and we have, 


by Theorem 1.5, 
/ Feds = | Feds =~ | Peds 


opp 


=- f Fas 
Č 


EXAMPLE 10 Let C be the upper semicircle of radius 2, centered at (0, 0) 
and oriented counterclockwise from (2, 0) to (—2, 0). Then we may calculate 
Je@? — y? + 1)ds by choosing any parametrization for C. For instance, we may 
parametrize C by 


x(t) = (2cost,2sint), O<t<z 


y 
(2, 12, 0) 


x 


Figure 6.12 The oriented curve 
of Example 11. 
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or by 
y(t) = (—2 cos 2t, —2 sin 2t), —m/2<t<0. 
(Note that y(t) = x(2t + 7 ).) Then 


[o@-y+nas= [@ -7+ Das 
C x 


=i (4cos? t — 4 sin? t + 1)V4sin? t + 4cos?t dt 
0 


= I (4cos2t + 1)2dt = 2 (2 sin2t + t)|} = 27, 
0 
by the double-angle formula. Similarly, we check 


[ee-y+nas= [e-+ nas 
c y 


0 
=i (4 cos’ 2t — 4 sin? 2t + 1)V 16 sin? 2t + 16 cos? 2t dt 
=r /2 


0 
z] (4cos4t + 1) - 4dt = 4(sin4t + 1)|° „p = 27. 
—r/2 


EXAMPLE 11 We calculate the work done by the force 


F = xi — yj + œx + y + z)k 


on a particle that moves along the parabola y = 3x?, z = 0 from the origin to the 
point (2, 12, 0). (See Figure 6.12.) 

We parametrize the parabola by x = t, y = 3t?, z = 0 for 0 < t < 2. Then, 
by Definition 1.2, 


Work = f F-as= [Pe-as= f Rowy-xoar 


2 2 
al 0-3,1 +38- 6,0)dr = | (t — 184°) dt 
0 0 


= (47? — 344)" — 2-72 = -70. 


The meaning of the negative sign is that by moving along the curve in the indicated 
direction, work is done against the force. If we orient the curve the opposite 
way, however, then the work done in moving from (2, 12, 0) to (0, 0, 0) would 
be 70. + 


Numerical Evaluation of Line Integrals (optional) 


If we have a scalar line integral f c f ds, or a vector line integral f cF-ds, anda 
suitable parametrization x of C, then Definitions 1.1 and 1.2 enable the evaluation 
of the line integrals by means of definite integrals in the parameter variable. These 
definite integrals may be difficult—or even impossible—to evaluate exactly, so 
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we might need to resort to a numerical method (such as the trapezoidal rule or 
Simpson’s rule) to approximate the value of the integral. 


Figure 6.13 The curve C with n 
points chosen on it. 


If a (vector) line integral is given in differential form as f cMdx+Ndy+ 
P dz (oras f, M dx + N dy if we are working in R?), then we can give numerical 
approximations without resorting to any parametrization as follows. First, choose 
points x9, X,,...X, along C, where Xo is the initial point and x, is the terminal 
point. (See Figure 6.13.) Fork = 0,...,7, let us write 


Xk = (Xk, Yk, Zk) 
and, fork = 1,...,n, let 
AIk = Xk — Xe-1, AV = Yk — Ye-1, AZ = Zk — Zk-1- 


Finally, let x; = (xý, yg. z%) denote any point on the arc of C between x,_ and 
x;. Then we may approximate the line integral as 


J M dx + N dy + Pdz% Y [M Axr + NOJ) Ayr + PORDAx]. ©) 
E k=1 


Besides the approximation given in (6), we may also use a version of the 
trapezoidal rule adapted to the case of line integrals. In particular, the trapezoidal 
rule approximation T, to the line integral f, cM dx +N dy + P dzis 


n 


T, = 2 [owo + M(xx)) as + (N (x1) + N(x) a 


+ (P (xe-1) + P&)) fa 


n Axı n Av, 
Ma) + MEDEE +Y WE) + NCR) E 
k=1 k=1 
n A 
+ Pant POL). (7) 
k=1 
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Similar (and shorter) formulas, analogous to (6) and (7), may be given to approx- 
imate the two-dimensional line integral fi c M dx + N dy; we will not state them 
explicitly. 


EXAMPLE 12 Let C be the ellipse x? + 4y? = 4, oriented counterclockwise. 
(See Figure 6.14.) We approximate f c y? dx + x dy using the (two-dimensional 
version of) the trapezoidal rule (7). 


Figure 6.14 The ellipse C in 
Example 12. 


To make this approximation, let 
Xo = (2, 0), xı = (0, 1), x2 = (—2, 0), x3 = (0, —1), x4 = (2, 0). 


Then we have 
Axı = Ax = —2, Ax3 = Axg = 2; 


Ay = i Ay2 = Ay3 = —1, Ay4 = Í; 
Hence, from (7), we compute 


T4 = (0 + 1DF +0 +0 + OHO HA H0 


+ (2+ 0)5 + (0+ (2) + ((—2) + OS + (0 + 2)4 
=—1-14+14141414141=4. 


The exact answer may be found using the parametrization 


0<t<2z. 


x =2cost 
y=sint 


We calculate that 


27 
[ > ax+xay =j (sin? t(—2sint) + 2 cos? t) dt 
Cc 0 


20 
= I (21 — cos? t)(— sin t) + 1 + cos 2t) dt 
0 


= (2cost — Scosst+t+ 1 sin 24) |?" 


= 27. 
From this we see that our approximation is quite rough. It can be improved 
by increasing n while taking smaller values of Ax, and Ay,;. For instance (see 
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Figure 6.15 The ellipse C in Example 
12 with eight points marked. 


Figure 6.15), if we let 
xo = 2,0), xı = (v3, $), x2 = 0, D, xs = (-V3,4), 4 =(-2,0), 


x; = (-v3,-3). v=ü0 -i aS (v3,-3). xs = (2, 0), 
then 
Ax; = V3 —2, Axo = Ax = —V3, Axg = V3 — 2, Axs; =2— V3, 
Axs = Ax, = V3, Axg = 2 — V3; 


Ayı = Ay: = a Ay; = Ay = Ays = Ayo = —}, Ay7 = Ays = 5. 
2 


Therefore, f € y? dx + x dy is also approximated by 


Ts = (0 + G) a + ((4) +1) — (e+) a 
((4)’ | o?) = ! (o ( e 
(ar eer) a 
(( 1)? JE (24 we ele yl? 
Onn WoL HEV +D 
(2) + ( WoL + (-V3) 4 no ~ 
(04 ve k (v3 4 g 


= 2 + 2V3 ~ 5.4641. 


Although still rough, this represents a better approximation. 
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EXAMPLE 13 Let C be the line segment from (0, 0, 0) to (2, 2, 2), and con- 


sider the line integral f cyvlt x3 dx +e dy + cos (2?) dz. The standard 
parametrization of the line segment C by 


0<t<2 


II 
= 


z= 


enables us, in theory, to evaluate the line integral above by evaluating the 
one-variable integral 


2 
f (rv 1+2+¢ + cos 0) dt. 
0 


Unfortunately, we cannot do so exactly (1.e., by means of the fundamental 
theorem of calculus, since an antiderivative of the integrand cannot be found 
in terms of elementary functions). Instead, we provide approximations using 
formulas (6) and (7). 

Let n = 4. If we choose points along C that are regularly spaced, then we 
have 
xo = (0,0,0), xı = (33 3); X2 = (1,1, 1), x; = (3, 3, 3), X4 = (2, 2, 2), 
so that 

Axx = Ayk = AZk = L, 
To calculate an approximation using formula (6), we can, for example, let 
xý = 4&1 + X), 

which is the midpoint of the line segment joining x,_; and x4. Then 


= (74) = (57-9) 6 = (579). = (79), 


and so formula (6) tells us that 


i y V14x3dx +e dy + cos (z?) dz 
G 


~|1 1)31 —1/16 1 i] 
~|; DP i $+ 008 454] 


3 3)31 4 -9/161 Dl 
E 1 og € 7 + COS 7 +5 


5 5\31 —25/161 25 1 
E G) zte z +cos 75° 3 


Less 


7 7)31 4 o—49/161 4 ogc 49. 1 
[3V as e 2 T COS 76 J 


~ 5.16422. 
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Using the trapezoidal rule formula (7), we may also approximate the line inte- 
gral by 


T,= (0+ TF W) We 4 (; 1+(!) + ITET) 12 
+ (wis P+ lie 3’) ESN (yi +(3) +2vTF) 12 


+ (2° $e aa aTa 4 (el 4 e HAIZ 


(e74 + e 4) 42 + (cos0 + cos 1)? + (cos } + cos 1) 


(cos 1 + cos ye (cos 3 cos 4) 12 
~ 5.44874. 


As was the case in Example 12, because of the small number of sampling 
points used, formulas (6) and (7) provide relatively crude approximations. + 


6.1 Exercises 


1. Let f(x, y) = x + 2y. Evaluate the scalar line integral In Exercises 8-16, find f F «ds, where the vector field F and 


J, f ds over the given path x. the path x are given. 
(a) x(t) = (2 — 31, 4t —1),0<t <2 8. F=xi+yj+zk, x()=(r+1,t,3¢—1), 0< 
(b) x(t) = (cost, sint), 0 <t <x t<1 
In Exercises 2-7, calculate f fds, where f and x are as 9. F=(y+2)i+xj, x(t) = (sint, — cost), 0<t< 
indicated. x/2 
2. f(x, y, z) = xyz, X(t) = (t, 2t, 3t), 0 <t <2 10. F=xi+ yj, x(t) = Qt+1,t+2)0<t<1 
3. f(x, y,z)= zE x(t) = (17°), let23 11. F=(y—x)itsyjx@=(@,0),-lstsl 
z 
í 12. F=xi+xyj+xyzk, x(t) = (3 cost, 2sint, 5t), 
4. f(x, y,z) = 3x + xy +z, x(t) = (cos 4t, sin 4t, 3t), 0<t<2x 
0<t<2r 
: 13. F = —3yi+xj+3z?k, x(t) = (2t +1,t? +t, e'), 
5. f(x, y, z) = ———, x(t) = (e” cos 3t, e” sin 3t, e”), O<t<l 
x24 y? 
0<1t<5 14. F=xi+yj—zk,x(t) = (t, 377, 20), -1<t<1 
6. f(x,y,z) =x+y+z, 15. F = 3zi + y? j+ 6zk, x(t) = (cost, sint, t/3), 0 < 
(2t, 0, 0) if0<t<1 t<40 
x(t) = į (2,3t—3,0) ifl<r<2 16. F = ycoszi+ x sinz j + xy sinz? k, x(t) = (t, t, 
(2,3,2t—4) if2<1t<3 P),0<t<l 
7: {Qa lt = ye ee, 17. Determine the value of f, x dy — y dx, where x(t) = 


ine (t,t, 0) if0<t<1 (cos 3t, sin3t), 0 <t <x. 
A= Tie ay 4th eee 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


Find the work done by the force field F = 2xi+j 
when a particle moves along the path x(t) = (t, 3t?, 2), 
0<t<2. 


If x= (e” cos3t, e” sin3t), 
xdx + ydy 
x (x? $ PpP 
Let C be the portion of the curve y = 2,/x between 
(1, 2) and (9, 6). Find f, 3y ds. 


0<t<2r, find 


Let F = (x? + y)i + (y — x)j and consider the two 
paths 


x(t) = (t,t), O<t<1 and 


y(t) = (1 — 20, 40? —4t + 1), 0<t< 


(a) Calculate f, F-ds and f, F ds. 


(b) By considering the image curves of the paths x and 
y, discuss your answers in part (a). 


Find the work done by the force field F = x? yi + zj + 
(2x — y)k ona particle as the particle moves along a 
straight line from (1, 1, 1) to (2, —3, 3). 


Let F = (22° — 3xy)i— x? j + x?z k. Calculate the 
line integral of F around the perimeter of the 
square with vertices (1, 1, 3), (—1, 1, 3), (—1, —1, 3), 
(1, —1, 3), oriented counterclockwise about the z-axis. 


Evaluate [(.(x? — y)dx + (x — y’) dy, where C is the 
line segment from (1, 1) to (3, 5). 


Find fo xy dx — (x + y) dy, where C is the trapezoid 
with vertices (0, 0), (3, 0), (3, 1), and (1, 1), oriented 
counterclockwise. 

Evaluate fo x*y dx — xy dy, where C is the curve with 
equation y? = x°, from (1, —1) to (1, 1). 

Evaluate J c y dx — x dy, where C is the portion of the 
parabola y = x’, from (3, 9) to (0, 0). 


Evaluate fo (x — y} dx + (x + yY dy, where C is the 
portion of y = |x|, from (—2, 2) to (1, 1). 


Evaluate F c xy? dx — xy dy, where C is the semicir- 
cular arc from (0, 2) to (0, —2) traveled clockwise. 


Find the circulation of F = (x? — y)i + (xy + x)j 
along the circle x? + y? = 16, oriented counterclock- 
wise. 


Evaluate fe yzdx —xzdy + xy dz, where C is the 
line segment from (1, 1, 2) to (5, 3, 1). 


Calculate Je zdx +x dy + y dz, where C is the curve 
obtained by intersecting the surfaces z = x* and 
x? + y? = 4and oriented counterclockwise around the 
z-axis (as seen from the positive z-axis). 


33. 


34. 


35. 


427 


6.1 | Exercises 


Show that f T -ds equals the length of the path x, 
where T denotes the unit tangent vector of the path. 


Tom Sawyer is whitewashing a picket fence. The 
bases of the fenceposts are arranged in the xy-plane 
as the quarter circle x* + y* = 25, x, y > 0, and the 
height of the fencepost at point (x, y) is given by 
h(x, y) = 10 — x — y (units are feet). Use a scalar 
line integral to find the area of one side of the fence. 
(See Figure 6.16.) 


Figure 6.16 The picket fence of 
Exercise 34. The base of the fence is the 
quarter circle x? + y? = 25,x,y>0. 


Sisyphus is pushing a boulder up a 100-ft-tall spiral 

staircase surrounding a cylindrical castle tower. (See 

Figure 6.17.) 

(a) Suppose Sisyphus’s path is described parametri- 
cally as 


x(t) = (Scos3t, 5sin 3t, 10t), O<t< 10. 


Figure 6.17 Sisyphus’s path up 
the spiral staircase of Exercise 35. 


If he exerts a force with a constant magnitude of 
50 lb tangent to his path, find the work Sisyphus 
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36. 


37. 


38. 


39. 


40. 


41. 
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does in pushing the boulder up to the top of the 
tower. 


(b) Just as Sisyphus reaches the top of the tower, he 
sneezes and the boulder slides all the way to the 
bottom. If the boulder weighs 75 lb, how much 
work is done by gravity when the boulder reaches 
the bottom? 


A ship is pulled through a 14-ft-long straight channel 
by a line that passes from the ship around a pulley. 
Assume that a coordinate system is set up so that the 
pulley is at the point (24, 32), and the ship is pulled 
along the y-axis from the origin to the point (0, 14). If 
the tension on the line is kept at a constant 25 Ib, find 
the work done in tugging the ship through the channel. 


Suppose C is the curve y = f(x), oriented from 
(a, f(a)) to (b, f(b)) where a < b and where f is 
positive and continuous on [a, b]. If F = yi, show that 
the value of fọ F + ds is the area under the graph of f 
between x = a andx = b. 


Let F be the radial vector field F = xi+ yj+zk. 
Show that if x(t), a <t <b, is any path that lies 
on the sphere x? + y? +z? =c?, then f F-ds=0. 
(Hint: Ifx(t) = (x(t), y(t), z(¢)) lies on the sphere, then 
KOP + bP + kk = c?. Differentiate this last 
equation with respect to t.) 


Let C be a level set of the function f(x, y). Show that 
IcVf+ds=0. 


You are traveling through Cleveland, famous for its 

lake-effect snow in winter that makes driving quite 

treacherous. Suppose that you are currently located 

20 miles due east of Cleveland and are attempting 

to drive to a point 20 miles due west of Cleveland. 

Further suppose that if you are s miles from the center 

of Cleveland, where the weather is the worst, you can 

drive at a rate of at most v(s) = 2s +20 miles per 
hour. 

(a) How long will the trip take if you drive on a 
straight-line path directly through Cleveland? (As- 
sume that you always drive at the maximum speed 
possible.) 


(b) How long will the trip take if you avoid the middle 
of the city by driving along a semicircular path 
with Cleveland at the center? (Again, assume that 
you drive at the maximum speed possible.) 

(c) Repeat parts (a) and (b), this time using v(s) = 
(s?/16)+ 25 miles per hour as the maximum 
speed that you can drive. 


Consider a particle of mass m that carries a charge q. 
Suppose that the particle is under the influence of both 
an electric field E and a magnetic field B so that the 
particle’s trajectory is described by the path x(t) for 


42. 


43. 


44. 


a < t <b. Then the total force acting on the particle 
is given in mks units by the Lorentz force 
F = q(E+Vv x B), 

where v denotes the velocity of the trajectory. 

(a) Use Newton’s second law of motion (i.e., F = ma, 
where a denotes the acceleration of the particle) 
to show that 

marv=qE-v. 

(b) Ifthe particle moves with constant speed, use part 
(a) to show that E does no work along the path 
of the particle. (Hint: Apply Proposition 1.7 of 
Chapter 3 to v.) 


Let C be the segment of the parabola y = x” be- 
tween (0, 0) and (1, 1) and consider the line integral 
te y? dx — x? dy. 

(a) Let xo = (0,0), xı = (4,4), x2 = (4,4), x3 = 
(3, iz) x4 = (1, 1). Use the trapezoidal rule for- 
mula (7) and these points to approximate the line 
integral. 


(b) Now calculate the exact value of the line integral 
and compare your results. 


Let C be the line segment from (0, 0, 0) to (1, 2, 3) 
and consider the line integral fe yz dx + (x + z)dy + 
xy dz. 

(a) Divide the segment into five regularly spaced 
points xo, X1, . . . , X4 and use the trapezoidal rule 
formula (7) to approximate this line integral. 

(b) Compare your approximation with the exact value 
of the line integral. 


Suppose that magnetic field measurements are made 
along a wire shaped as a curve C and the results are 
tabulated as follows: 


k Point x, = B(xk, Yk, zk) = 
(Xk, Yk: Zk) | Mi+Nj+Pk 
o| (-1,-2,-1 k 
1 (0, 1, —1) j+2k 
2) (0,2,0) i+j+2k 
3/ 2D 2i+j+2k 
4| (1.2.2) 2i +2) + 2k 
5| LJ 2i+3j+3k 
6| GLb 3i+3j +3k 
7. (1.0.0) 4i+ 3j +3k 
8 (0, 0, 0) 4i + 3j + 4k 


By writing f, B + ds as fẹ M dx + N dy + P dz, esti- 
mate the work done by B along C using 
(a) a trapezoidal rule approximation; 


(b) a trapezoidal rule approximation using only the 
points Xo, X2, X4, X6, Xg. 


Figure 6.18 The shaded region D 
has a boundary consisting of two 
simple, closed curves, each of class 
C!, whose union we call C. 


Figure 6.19 The region 
D of Example 1. 
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6.2 Green’s Theorem 


Green’s theorem relates the vector line integral around a closed curve C in R? 
to an appropriate double integral over the plane region D bounded by C. The 
fact that there is such an elegant connection between one- and two-dimensional 
integrals is at once surprising, satisfying, and powerful. Green’s theorem, stated 
generally, is as follows: 


THEOREM 2.1 (GREEN’S THEOREM) Let D be a closed, bounded region in R? 
whose boundary C = ð D consists of finitely many simple, closed, piecewise C! 
curves. Orient the curves of C so that D is on the left as one traverses C. (See 
Figure 6.18.) Let F(x, y) = M(x, y)i+ N(x, y)j be a vector field of class C! 
throughout D. Then 


dN OM 
f max + way = ff (Se -axa 
‘al D Ox dy 


(The symbol fe indicates that the line integral is taken over one or more closed 
curves.) 


EXAMPLE 1 LetF = xyi + y? jandlet D be the first quadrant region bounded 
by the line y = x and the parabola y = x?. We verify Green’s theorem in this case. 

The region D and its boundary are shown in Figure 6.19. 0D is oriented 
counterclockwise, the orientation stipulated by the statement of Green’s theorem. 
To calculate 


f Feds= § xy dx + ydy, 
dD dD 


we need to parametrize the two C! pieces of 0D separately: 


St al es A 
Ci: j 0<t<1 and C}: 0<t<l. 
y=t y=1-t 


(Note the orientations of Cı and C2.) Hence, 


$ sydx+ ydy = | xydx + yay + | xydx + y*dy 
aD Cı C2 


1 1 
=] (t-1° +17 -2t) dt4 Í (a-e + (1 — Adr) 
0 0 


1 1 
= E+ d f 2(1 — 1)°(—dt) 
0 0 


= (G+ H+ GA-k 
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On the other hand, 


1 x 1 
I [žo = Za) dxdy = | —x dy dx = l —x(x — x°)dx 
p Lox dy 0 Jx 0 


The line integral and the double integral agree, just as Green’s theorem says they 
must. + 


y EXAMPLE 2 Consider fe —y dx + x dy, where C is the circle of radius a (i.e., 
C the boundary of the disk D of radius a), oriented counterclockwise as shown in 
Figure 6.20. Although we can readily parametrize C and thus evaluate the line 

integral, let us employ Green’s theorem instead: 


: f -vax +xdy= S| z0- Žo] dxdy 


= I 2dx dy = 2(area of D) = 2ra’. 
Figure 6.20 The disk of 2 


radius a with boundary The rightmost expression is twice the area of a disk of radius a. In this case, the 
oriented so that Green’s double integral is much easier to consider than the line integral. + 
theorem applies. 


The use of Green’s theorem in Example 2 can be put in a much more general 
setting: Indeed, if D is any region to which Green’s theorem can be applied, then, 
orienting 0 D appropriately, we have 


if -ydx-+xdy=4 | f 2dx dy = area of D. (1) 
aD D 


Thus, we can calculate the area of a region (a two-dimensional notion) by using 
line integrals (a one-dimensional construction)! 


y EXAMPLE 3 Using formula (1), we compute the area inside the ellipse 
(0, b) x? /a? + y*/b* = 1 (Figure 6.21). 
The ellipse itself may be parametrized counterclockwise by 


0<t<2r. 


Once again, using formula (1), we find that the area inside the ellipse is 
Figure 6.21 The region on 
inside the ellipse 5 f —ydx+xdy= 4 / —b sin t(—a sin t dt) + a cos t(b cost dt) 
xja +y jb = 1. aD 0 
20 
1 | (absint + ab cos? t) dt 
0 


2n 
if abdt = mab. 
0 


4 D 


Figure 6.22 A plane region D in 
R°. The vector k is normal to D. 
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Alternative Formulations 


We rewrite the line integral-double integral formula appearing in the statement 
of Theorem 2.1 (Green’s theorem) in two ways. These reformulations generalize 
to higher dimensions and provide some additional insight in interpreting the 
geometric significance of Green’s theorem. 

To begin, consider a C! vector field 


F = M(x, y)it N(x, y)j 


to be defined on R? by taking its k-component to be zero. Then, if we compute 
the curl of F, we find 


i j k 
aN ƏM 
VxF=} 0/dx <d/dy d/dz = (2%) 
M N 0 


Therefore, because k- k = 1, we obtain 


J| C=- ffo xaa 


Since f,,, F -ds = fp Mdx + Ndy, Green’s theorem may be rewritten as 
follows: 


PROPOSITION 2.2 (A VECTOR REFORMULATION OF GREEN’S THEOREM) If D is 
a region to which Green’s theorem applies and 


F = M(x, y)it N(x, y)j 


is a vector field of class C! on D, then, orienting dD appropriately, 


$ F-as= | [ov xny-Kaa. 


To understand this result, visualize the plane region D as sitting in the xy- 
plane in R°. (See Figure 6.22.) The vector k is a unit vector normal to D, and 
Sf p(V x F):kdA is the double integral of the component of the curl of F normal 
to D. Since the line integral f p F» ds is the circulation of F along 0D (see 
86.1), the equation of Proposition 2.2 tells us that, under suitable hypotheses, the 
circulation of a vector field F along the boundary of a plane region is equal to 
the total (or net) “infinitesimal rotation” of F over the entire region. (See also 
§3.4 where the curl of a vector field is given an intuitive interpretation in terms of 
rotation—or wait until 7.3 when the notion of curl measuring rotation of a vector 
field is explained more precisely.) This result generalizes to Stokes’s theorem 
in RÌ. Stokes’s theorem relates the integral of the component of the curl of a 
three-dimensional vector field F that is normal to a surface in R? to the line 
integral of F over the boundary curves of the surface. 

Next, we reformulate Green’s theorem in another way. Once again, assume 
that D is a region in R? to which Green’s theorem applies and that its boundary 
curves are oriented appropriately. At each point along the C! segments of dD, 
let n denote the unit vector that is perpendicular to dD and points away from the 
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Figure 6.23 The outward unit 
normal n to the region D. 


region D. (We call n the outward unit normal vector to D. See Figure 6.23.) 
Then we can demonstrate the following: 


THEOREM 2.3 (DIVERGENCE THEOREM INTHE PLANE) If Disaregion to which 
Green’s theorem applies, n is the outward unit normal vector to D, and 


F = M(x, y)i+ N(x, y)j 


is a C! vector field on D, then 
f F-nds = |f V-FdaA. 
aD D 


PROOF Ifx(t) = (x(t), y(t)),a < t < b, parametrizes a C! segment of 3 D, then 
along this segment the unit vector n may be obtained geometrically by rotating 
the unit tangent vector x’(t)/ | x’(t) | clockwise by 90°. In particular, along such a 
parametrized C! segment, 


yOI-xOI _ yYOi- xi 
VP + yO |x| 


We calculate the line integral $, F- nds. Along each C! segment of dD, 
the contribution to the line integral may be evaluated as 


b 
f EEDD |x| a 


b are 
= Mews NOW, yj) 212 O4| 


a kol x’(1)|| dt 


b 
=] (M(x(t), YEY O — NEC), VEXO) dt 


a 


= [ -Ndx + May. 


Thus, by Green’s theorem, 


f vinden vars mina ff (D) a 
~ {Get a) 
= ff v-raa, 


by the definition of the divergence of F. m 


If C is a simple, oriented curve, the line integral f c F-nds, where n is the 
unit normal to C as defined in Theorem 2.3, is known as the flux of F across C. 
For example, if F represents the velocity vector field of a planar fluid, then the 
flux measures the rate of fluid transported across C per unit time. (We assume 
that F does not vary with time t.) 


(x, y) X F(x, y) Aten 


Figure 6.24 The shaded 
parallelogram has area 

F(x, y)At +n As, the approximate 
amount of fluid transported across 
the segment of C. 
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To see this, consider the amount of fluid transported across a small segment 
of C during a brief time interval At. As suggested by Figure 6.24, we have 


Amount of fluid transported ~ area of parallelogram 
x (F(x, y)At-njAs, (2) 


where As is the length of the segment of the curve C. Formula (2) is only ap- 
proximate, because the segment of curve need not be completely straight (so the 
parallelogram geometry is only approximate), and because the vector field F need 
not be constant (so the term F(x, y)At only approximates a flow line of F over the 
time interval). If we divide formula (2) by At, then the average rate of transport 
across the segment during the time interval At is (F(x, y) +n) As. If we now break 
up C into finitely many such small segments, sum the contributions of the form 
(F(x, y). n) As for each segment, and let all the lengths As tend to zero, we find 
that the average rate of fluid transport, denoted AM / At, is given approximately by 


AM 

— 7 | F-nds. 

At c 
Finally, letting At —> 0, we define the (instantaneous) rate of transport dM /dt 
to be 

dM 

a J F-nds, 

dt c 


which is the flux. 

In view of the remarks above, Theorem 2.3 tells us that the flux of F across 
the boundary of plane region D (i.e., what of F flows across ðD) is equal to 
the total (or net) divergence of F over all of D. We revisit the notion of flux in 
Chapter 7 in the case of a three-dimensional vector field F. In that setting, we are 
interested in the flux of F across a surface in RÌ; defining such a concept requires 
a surface integral. Then Theorem 2.3 generalizes to three dimensions as Gauss’s 
theorem (also called the divergence theorem). Gauss’s theorem relates the flux 
of a three-dimensional vector field F across a closed surface to the triple integral 
of the divergence of F over the solid region enclosed by the surface. 


Proof of Green’s Theorem 


We establish Green’s theorem (Theorem 2.1) in three major steps. The first two 
steps consist of proofs of special cases of Theorem 2.1. The third step is an outline 
of how the special cases may be used to provide a full proof of the general case. 
As a result, we fall short of a complete, rigorous proof of the very general version 
of Green’s theorem stated in Theorem 2.1. However, what we do prove makes use 
of the important geometric ideas of multiple integration and line integrals, and 
what we do not prove is rather technical. 


Step 1. We establish Green’s theorem when D is an elementary region in R? 
of type 3. Thus, D can be described in two ways (see Figure 6.25): 


D = {(x, y) E R? | yœ) < y < 6(x), a < x < b} 
= {(x, y) € R? | a(y) < x < BO), c < y <d}. 


The first description of D is as a type 1 elementary region; the second is as a type 
2 region. (Recall that a type 3 region is one that is of both type 1 and type 2.) We 
assume that the functions a, 6, y, and ô are all continuous and piecewise C!. 
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y=6(x) 


y=7(x) 


| 
| 
| 
l 
| 
| 
| 
i 
a 


Figure 6.25 A type 3 elementary region D analyzed as both a type 1 and 
type 2 region. Note the orientations of the boundary curves. 


Viewing D as a type 1 elementary region, we evaluate part of f} p Mdx + 
Ndy, namely, f, , Mdx. Note that ðD consists of a lower curve C; and an upper 
curve C2. If we parametrize these curves as follows: 


. x=t . 
aes a<t<b and c: | 


then C> is oriented opposite to the desired orientation shown in Figure 6.25. 
Bearing this in mind, we compute 


$, M(x, y)dx = J. M(x, y)dx — i Meid: 


(Note the minus sign!) Then 


b b 
$ Mo. ydx = f M(t, voar- | M(t, 5(t)) dt 
aD a a 


b 
= | ime, vo) - Me, sea. 


Now we compare the calculation of ¢,,,Mdx with that of f f —(8M/ðy) dA. 


We have 
aM b pè ƏM 
// -a= f l — —— dydx 
D dy a y(x) dy 


b 
= | [-M(Q, (x) + MQ, yx) dx 


a 


b 
= | ie. yo) - Me, ddz. 


(Note that the fundamental theorem of calculus was used here.) Thus, we see that, 
in this case, 


Figure 6.26 The region 

D= D; U Dz U D3 U Da, where 
each subregion D;, i = 1, 2, 3, 4, is 
elementary of type 3. 


Not part 
of aD 
Part of dD Part of dD 
\ 


Figure 6.27 The common 
boundary of subregions D; and Dj; 
is oriented one way as part of ð D; 
and the opposite way as part of 
dDj;. Hence, f,, Mdx + Ndy + 
fap, Mdx + Ndy will cancel over 
this common boundary. 
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In an analogous manner, we can show 


aN 
f Nay = Í —dA 
aD p Ox 


by viewing D as a type 2 elementary region. We omit the details, except to say 
that both the line integral and the double integral can be shown to be equal to 


d 
/ [N(B(), y) — N(a(y), y)] dy 


Finally, since D is simultaneously of type | and type 2, 


$, M(x, y)dx + N(x, yay =p maxi h N dy 


TE 
E 


Step 2. Now suppose that D is not an elementary region of type 3, but 
that it can be subdivided into finitely many type 3 regions Dı, D2,..., Dn in 
such a way that these subregions overlap at most two at a time and only along 
common boundaries. Such a region D would look something like the one shown in 
Figure 6.26. By Step 1, Green’s theorem holds for each subregion. Hence, we have 


/ oN — M,)dA = JI, (N, — M. pdas ff (N; — My)dA 


+: ot ff 0- M,)dA 


= M dx +N dy + M dx + N dy 
aDi aD» 


tot M dx + N dy. (3) 


At this point, it is tempting to conclude immediately that the sum of the line 
integrals in equation (3) is fp Mdx + Ndy. However, ðD; may contain more 
than only portions of ð D. The trick is to note that any portion of ð D; that is not 
part of 0D is part of exactly one other 0 D;. Moreover, this overlapping portion 
is given one orientation by ðD; and the opposite orientation by ð Dj. When we 
take the sum of the line integrals in equation (3), any contributions arising from 
the components of the ð D;’s that are in the interior of D will cancel in pairs. 
(See Figure 6.27.) Therefore, 


Tf (N; — My)dA = Mdx + Ndy + Mdx + Ndy 
D aD, 


ə Dı 


+o Mdx + Ndy 


-¢$ Mdx + Nady; 
aD 


Green’s theorem is established in this case. 
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Step 3. Unfortunately, not all regions described in the statement of Theo- 
rem 2.1 can be subdivided into finitely many elementary regions of type 3. Here 
is an outline of what we might do to prove Green’s theorem in such generality. 

First, we claim (without proof) that for regions D described in the statement 
of Theorem 2.1 we can produce a sequence of regions D1, D2, ..., Dn, ... whose 
“limit” asn — oo is D and such that each D, can be subdivided into finitely many 
type 3 elementary regions. Next, we claim that 0D, —> 0D as n — oo. Finally, 
we need to prove that, as n — oo, 


J| -maa = f| -maa 


f Mdx+Ndy — $ Mdx+Nay. 
aD, aD 


and 


Since Green’s theorem holds for each D, (by Steps 1 and 2), we are done.? & 


Historical Note® 


The idea that the line integral of a vector field along a closed curve can be related 
to a double integral over the region bounded by the curve is frequently attributed 
to George Green (1793—1841), a self-educated English mathematician. The result 
we have been calling Green’s theorem had its origins in a rather obscure 1828 
pamphlet published by Green, in which he sought to lay a rigorous mathematical 
foundation for the physics of electricity and magnetism. Green’s ideas arose from 
work in partial differential equations concerning gravitational potentials. Green’s 
pamphlet subsequently came to the attention of Lord Kelvin (1824-1907), who 
had it republished so that, fortunately, Green’s results received greater recognition. 

Coincidentally, a result similar to Green’s theorem was established indepen- 
dently (and also in 1828!) by the Russian mathematician Mikhail Ostrogradsky 
(1801-1861). Ostrogradsky’s name is sometimes associated to what we call 
Green’s theorem. 


In Exercises 1—6, verify Green's theorem for the given vector 2. F=(x*—y)i+(x+y’)j, D is the rectangle 


field 


bounded by x = 0, x = 2, y = 0, and y = 1. 


F = M(x, y)i + N(x, y)j 3. F= yi+x?j, D is the square with vertices (1, 1), 


and region D by calculating both 


(-1, 1), (-1, —1), and (1, —1). 


4. F = 2yi+ xj, D isthe semicircular region x? + y? < 


Mdx+Ndy and II (N; — My)dA. a’,y>0. 
2p p 5. F=3yi—4xj, D is the elliptical region x? + 
1. F = —x?yi + xy? j, D is the disk x? + y? < 4. 2y? < 4. 


2 For details of the type of limit argument we have in mind, see O. D. Kellogg, Foundations of Potential 
Theory (Springer, Berlin, 1929; reprinted by Dover Publications, New York, 1954), pp. 113—119, where 
a limit argument is given in the case of Gauss’s theorem, which we explore in §7.3. For a proof of Green’s 
theorem that avoids the limit argument, see D. V. Widder, Advanced Calculus, 2nd ed., (Prentice-Hall, 
Englewood Cliffs, 1961; reprinted by Dover Publications, New York, 1989), pp. 223-225. 

3 See also M. Kline, Mathematical Thought from Ancient to Modern Times (Oxford Press, New York, 
1972), p. 683. 


6. 


9. 


10. 


11. 


F = (x?y + x)i + ° —xy’)j, D is the region in- 
side the circle x? + y? = 9 and outside the circle 
x+ y? = 4. 


. (a) Use Green’s theorem to calculate the line integral 


§ y-dx + xdy, 
C 


where C is the path formed by the square with 
vertices (0, 0), (1, 0), (0, 1), and (1, 1), oriented 
counterclockwise. 

(b) Verify your answer for part (a) by calculating the 
line integral directly. 


. Let F = 3xyi + 2x? j and suppose C is the oriented 


curve shown in Figure 6.28. Evaluate 


f Feds 
C 


both directly and also by means of Green’s theorem. 


Figure 6.28 The oriented curve C of 
Exercise 8 consists of three sides of a 
square plus a semicircular arc. 


Evaluate 
§ (2 — dx +? + yay, 
C 


where C is the boundary of the square with vertices 
(0, 0), (1, 0), (0, 1), and (1, 1), oriented clockwise. Use 
whatever method of evaluation seems appropriate. 


Use Green’s theorem to find the work done by the vec- 
tor field 


F = (4y — 3x)i+ (@ — 4y)j 


on a particle as the particle moves counterclockwise 
once around the ellipse x? + 4y? = 4. 


Verify that the area ofthe rectangle R = [0, a] x [0, b] 
is ab, by calculating an appropriate line integral. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 
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Let a be a positive constant. Use Green’s theorem to 
calculate the area under one arch of the cycloid 
x =a(t— sint), y =a(l — cost). 


Evaluate ¢.(x*+y> — 2y)dx + (3x + x°y*)dy, where 
C is the oriented curve pictured in Figure 6.29. 


Figure 6.29 The oriented curve C of 
Exercise 13. 


Use Green’s theorem to find the area enclosed by the 
hypocycloid 


x(t) = (a cos? t, a sin? t) O<t<2nz. 


(a) Sketch the curve given parametrically by x(t) = 
(=t e —1). 


(b) Find the area inside the closed loop of the curve. 


Use Green’s theorem to find the area between the 
ellipse x? /9 + y?/4 = 1 and the circle x? + y? = 25. 


Show that if D is a region to which Green’s theorem 
applies, and ð D is oriented so that D is always on the 
left as we travel along dD, then the area of D is given 
by either of the following two line integrals: 


Area of D = xdy=—$ ydx. 
aD aD 


Find the area inside the quadrilateral whose vertices 
taken counterclockwise are (2, 0), (1, 2), (—1, 1), and 
d, 1). 


Suppose that the successive vertices of an n-sided poly- 
gon are the points (a), b1), (a2, b2), .. . (dn, bn), ar- 
ranged counterclockwise around the polygon. Show 
that the area inside the polygon is given by 


1 4 fan bn 
2 a, bil}? 


Let a be a positive integer throughout this problem. 
An epicycloid is the path produced by a marked point 
on a circle of unit radius that rolls, without slipping, 


a bz 
az b3 


a bi 
a bz 


Qn-1 bn-1 
an bn 
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21. 


22. 


23. 


24. 
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on the outside of a fixed circle of radius a. If the cen- 

ter of the fixed circle is at the origin and the marked 

point is at (a, 0) when t = 0, then the epicycloid is 

given by the path x(t) = ((a + 1) cost — cos (a + 1)t, 

(a + 1)sint — sin (a + 1)t). (See Exercise 35 of the 

Miscellaneous Exercises for Chapter 1.) 

(a) Show that the epicycloid path meets the fixed cir- 
cle exactly when ¢ = 27rn/a, where n is an integer. 
(Hint: This must happen when ||x(t)|| = a.) Graph 
the epicycloid when a = 5, 6. 


(b) Use an appropriate line integral to find the area 
enclosed by the epicycloid. 


(c) As the integer a gets larger, what happens to the 
ratio of the area calculated in part (b) to that of the 
fixed circle? 


Evaluate the line integral fo 5y dx — 3x dy, where C 
is the cardioid with polar coordinate equation r = 
1 — sin, oriented counterclockwise. 


(a) Suppose that C is a simple, closed curve that does 
not enclose the origin. Use Green’s theorem to de- 
termine the value of 


$ xdx + ydy 
c x2 4 y2 
(b) Now suppose that C is a simple, closed curve that 


does enclose the origin. Can you use Green’s the- 
orem to determine the value of 


gee. 
č Faye 


(c) Let Cı and Cz be two simple, closed curves that 
both enclose the origin, are both oriented coun- 
terclockwise, and do not touch or intersect. Show 
that 


Explain. 


$ xdx + ydy -¢$ xdx+ydy 

a ay Ja Hy 

(d) Use the result of part (c) to determine the value of 

$ xdx + ydy 
g ay 

where C is a simple, closed curve that encloses the 
origin. 

(a) Use the divergence theorem (Theorem 2.3) to show 
that $- F - nds = 0, where F = 2y i — 3x j and C 
is the circle x? + y? = 1. 

(b) Now show $o F- nds = 0 by direct computation 
of the line integral. 


Let F = M(x, y)i+ N(x, y)j. The divergence theo- 
rem shows that the flux of F across a closed curve C 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


(i.e., fo F + mds) is equal to ff (M, + Ny) dA, where 
D is the region bounded by C. Use Green’s theorem 
to establish a similar result involving f- F » T ds, the 
circulation of F along C. (See also §6.1.) 


Let C be any simple, closed curve in the plane. Show 
that 


§ 3x7y dx +x3 dy =0. 
c 
Show that 
§ —y dx + (x? + 2x + y)dy 
Ç 
is positive for any closed curve C to which Green’s 
theorem applies. 
Show that if C is the boundary of any rectangular re- 
gion in R?, then 
fpo — 3y)dx + x°y dy 
C 
depends only on the area of the rectangle, not on its 
placement in R°. 


Let r = xi + yj be the position vector of any point 
in the plane. Show that the flux of F = r across any 
simple closed curve C in R? is twice the area inside C. 


Let D be a region to which Green’s theorem applies 
and suppose that u(x, y) and v(x, y) are two functions 
of class C? whose domains include D. Show that 


[hei fura 


where C = 0D is oriented as in Green’s theorem. 


Let f(x, y) be a function of class C? such that 
Pf PF 


es: EREN 

əx? ay? 
(i.e., f is harmonic). Show that if C is any closed curve 
to which Green’s theorem applies, then 

a ð 

A F ax = l y =0. 

dy Ox 
Let D be a region to which Green’s theorem applies 
and n the outward unit normal vector to D. Suppose 
f(x, y) is a function of class C?. Show that 


J| rag of as, 


where Vf denotes the Laplacian of f (namely, 
Vf =a fJðx? +07 f/dy?) and df/dn denotes 
V f -n. (See the proof of Theorem 2.3 for more in- 
formation about n.) 


A C 


Figure 6.30 IfF has path- 
independent line integrals, then 
Jc, F+ ds = fc, F + ds for any two 
piecewise C! oriented curves from 
A to B. 


y (1,1) 


Figure 6.31 The curves 
Cı and C2 of Example 1. 
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6.3 Conservative Vector Fields 


As seen in §6.1, line integrals ofa given vector field depend only on the underlying 
curve and its orientation, not on the particular parametrization of the curve. In 
some special instances, however, even the curve itself doesn’t matter, only the 
initial and terminal points. A vector field having the property that line integrals of 
it depend only on the initial and terminal points of the oriented curve over which 
the line integral is taken is said to have path-independent line integrals. We next 
state a more careful definition, characterize such vector fields, and explore their 
significance. 


Path Independence 


DEFINITION 3.1 A continuous vector field F has path-independent line 


integrals if 
if F-ds= i F-ds 
Ci C2 


for any two simple, piecewise C!, oriented curves lying in the domain of F 
and having the same initial and terminal points. (See Figure 6.30.) 


EXAMPLE 1 Let F = yi— xj and consider the following two curves in R? 
from the origin to (1, 1): C4, the line segment from (0, 0) to (1, 1), and C3, the 
portion of the parabola y = x? (the curves are shown in Figure 6.31). These curves 
may be parametrized as 


y=t 


Then we calculate 


1 1 
[ Fas= fci-rp-atnar= f Odt =0, 
Ci 0 0 


x=fď x = 
ee 0<t<1 and c: | -p 0<t<l. 


while 
1 
/ F-ds = @i-tj-ü+2tj)dt 
C 0 
: 1 
= (0? — 217) dt = —48|, = —-}. 
0 
We see that 
/ F-ds# F -ds, 
C1 Co 
and so F does not have path-independent line integrals. + 


EXAMPLE 2 Let F = xi+ yj. This vector field has path-independent line 
integrals—we will see why presently. For the moment, however, we illustrate (not 
prove) this fact by considering the parabolic path x: [0, 1] > R?, x(t) = (t, t°), 
as well as the path y: [0, 2] > R? made up of the two straight segments 


yi:[0, 1] > R?, yi(t)=(0,t) and yp:[1,2] > R’, yo(t) = (t — 1, 1). 
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y2 (1,1) 


yı x 


Figure 6.32 The paths x 
and y (consisting of the 
paths yı and y2) of 
Example 2. 


Figure 6.33 The simple, 
closed curve C consists of 
two oriented curves from A 
to B. 


Both x and y are paths from (0, 0) to (1, 1) and are shown in Figure 6.32. We have 


1 
[rae | (ti+¢?j)-(i+ 2rjyadt 
x 0 


1 
=| (t + 2t7)dt = (47? + 444) = 1, 


and 


m 


F-as= | F-as+ | F-ds 
yı y2 


1 2 
=f niid f ((t — l)i+ j)-idt 
0 1 


1 2 
f! rar f dar 
0 1 


1 2 
jh + je- wh=b tbe 


II 


To establish that F has path-independent line integrals, we would need to check 
that the value of the line integral of F along any choice of path between any two 
points is the same as any other—a prohibitive task. + 


The following result is a reformulation of the path-independence property: 


THEOREM 3.2 Let F be a continuous vector field. Then F has path-independent 
line integrals if and only if fe F -ds = 0 for all piecewise C!, simple, closed 
curves C in the domain of F. 


PROOF First, assume that F satisfies the path-independence property. Suppose 
C is parametrized by x(t), a < t < b, where x(a) = x(b) = A. Let B be another 
point on C, and break C into two oriented curves Cı and C2 from A to B. One 
of these curves—say, Cı—will be oriented the same way as C and the other the 
opposite way. (See Figure 6.33.) Thus, 


fr-as= Fas- | F-ds=0, 
C Cj C2 


since F has path-independent line integrals. 

Conversely, suppose that all line integrals of F around simple, closed curves 
are zero. Then given two piecewise C!, oriented, simple curves C, and C with 
the same initial and terminal points, let C be the closed curve consisting of Cı 
and —C} (1.e., C2 with its direction reversed). Then, we have 


frais=) F-as+ | F-as= | F-ds— | F- ds. 
E Li —-C2 Cy Cz 


If C happens to be simple, then fe F -ds = 0 by assumption, so 


[ Fas= | F -ds, 
Ci C2 


Cı 


C 


Figure 6.34 The closed curve C 
constructed from C; and the 
reverse of C2 need not be simple. 
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as desired. However, C need not be simple even if Cı and C2 are. (See Figure 6.34.) 
If it is possible to break Cı and C3 into finitely many segments so that a segment 
C; of Cı and a segment C) of C3 either (i) completely coincide or (ii) together 
form a simple, closed curve C’, then it is not too difficult to modify the preceding 


argument to conclude that 
/ F-ds= J F-ds. 
Ci Co 


However, it is not always possible to do this, and further technical arguments 
(which we omit) are required. a 


We remark that it is not essential to assume that the curves in Definition 3.1 
and Theorem 3.2 are simple. We have done so here in order to make the proof of 
Theorem 3.5 below more straightforward. 


Gradient Fields and Line Integrals 


We describe next a class of vector fields that satisfy the path-independence prop- 
erty, namely, gradient fields. Suppose that F is a continuous vector field such that 
F = V f, where f is some scalar-valued function of class C!. (Recall that we 
refer to f as a scalar potential of F. We also call F a conservative vector field 
as well as a gradient field). Then, along any path x from A = x(a) to B = x(b) 
whose image lies in the domain of F, we have 


[reas [vs-as= f vreo xoa 


a 


It follows from the chain rule that d/dt[ f (x(t))] = V f (x(t)): x(t). Hence, 


b b d 
J Vf-ds= J VFO- x Adi = / OL 


a a 


= FAA]? = f(x(b)) — f(x(a)) = f(B) = f(A). 


Therefore, when F is a gradient field, the line integral of F depends only on the 
value of the potential function at the endpoints of the path. Hence, gradient fields 
have path-independent line integrals. The converse holds as well, as we prove at 
the end of this section. Stated formally, we have the following theorem: 


THEOREM 3.3 Let F be defined and continuous on a connected, open region R 
of R”. Then F = V f (where f is a function of class C! on R) if and only if F has 
path-independent line integrals over curves in R. Moreover, if C is any piecewise 
C!, oriented curve lying in R with initial point A and terminal point B, then 


[ F-ds = f(B) — f(A). 


Note: A region R C R” is connected if any two points in R can be joined by 
a path whose image lies in R. 


EXAMPLE 3 Consider the vector field F = x i+ y j of Example 2 again. You 
can readily check that F = V f, where f(x, y) = 5 (x? + y*). By Theorem 3.3, 
line integrals of F will be path independent; this fact was illustrated, but not 
proved, in Example 2 when we calculated the vector line integral of F along two 
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paths from (0, 0) to (1, 1). By Theorem 3.3, we see now that for any directed 
piecewise C! curve C from (0, 0) to (1, 1), we have 


[Peas=sa. 1)— f(0,0) = į? +15- 400° + 0?) = 1, 
C: 


which agrees with our earlier computations. + 


A Criterion for Conservative Vector Fields 


Theorem 3.3 tells us that a vector field F has path-independent line integrals 
precisely when it is a conservative (gradient) vector field and, moreover, that the 
line integral of F along any path is determined by the values of the potential 
function f at the endpoints of the path. Two questions arise naturally: 


1. How can we determine whether a given vector field F is conservative? 


2. Assuming that F is conservative, is there a procedure for finding a scalar 
potential function f such that F = V f? 


We answer the first question by providing a simple and effective test that can 
be performed on F. Should F pass this test (1.e., if F is conservative), then we 
illustrate via examples how to produce a scalar potential for F, thereby answering 
the second question. 

First, we need additional terminology. 


DEFINITION 3.4 A region R in R? or R? is simply-connected if it con- 
sists of a single connected piece and if every simple, closed curve C in R 
can be continuously shrunk to a point while remaining in R throughout the 
deformation. 


If R is a region in the plane, then R is simply-connected just in case it is 
connected and every simple, closed curve C lying in R has the property that all 
the points enclosed by C also lie in R. Loosely speaking, a simply-connected 
region (in either R? or R*) can have no “essential holes.” Illustrative examples are 
shown in Figures 6.35 and 6.36. The notion of continuously shrinking a curve to a 
point can be made fully precise, although we shall not take the trouble to do so here. 


(1) (2) 


Figure 6.35 (1) The region R; C R? is simply-connected: All points 
surrounded by any simple, closed curve in R; lie in R4. (2) In contrast, R2 is not 
simply-connected: Although the curve Cı encloses points that lie in R3, the 
curve C2 surrounds a hole. Hence, Cz cannot be continuously shrunk to a point 
while remaining in R2. 
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(1) n (2) f 


aie Ee 


x R, = R° — {(0, 0, 0)} x R, =R? - z-axis 


Figure 6.36 (1) The region Rı C R? is simply-connected. (2) The region R3 is not 
simply-connected: The curve C cannot be shrunk continuously to a point without 
becoming “stuck” on the “missing” z-axis. 


Now we state our criterion for a vector field to be conservative. 


THEOREM 3.5 Let F be a vector field of class C! whose domain is a simply- 
connected region R in either R? or R°. Then F = V f for some scalar-valued 
function f of class C? on R if and only if V x F = 0 at all points of R. 


Before proving Theorem 3.5, some remarks and examples are appropriate. 
First, note that Theorem 3.5 provides a straightforward way to determine ifa vector 
field F is conservative: Check that the domain of F is simply-connected, and then 
test if V x F = 0. If the curl vanishes, it follows that F has path-independent line 
integrals. This “curl criterion” can be helpful in practice. 

Inthe case where F = M(x, y)i+ N(x, y)jisatwo-dimensional vector field, 
the condition that the curl of F vanishes means 


i j k 
aN 3M 
VxF=| 3/əx a/dy 0/dz a )k=0 
M(x,y) N(x, y) 0 


This is equivalent to the condition 
ƏN ƏM 
əx dy” 


Equation (1) is a simpler condition to use in this situation. 


(1) 


EXAMPLE 4 Let F = x?yi— 2xy j. Then 
ð ð 
—(—2xy)= —2y and —(x?y) =x’, 
Ox dy 


Since these partial derivatives are not equal, we conclude that F is not conservative, 
by Theorem 3.5. + 


EXAMPLE 5 Let F = (2xy + cos 2y)i + (x? — 2x sin 2y)j. The vector field 
F is defined and of class C! on all of R? (a simply-connected region). Moreover, 


ə ð 
n” — 2x sin 2y) = 2x — 2sin2y and TA ey) = ae —2sin2y. 
x y. 
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(0, 0, 1) 


(1,0, 0) 


x 


Figure 6.37 The paths 

x(t) = (1-1, sinzt,t),0<t<1 
and y(t) = (cost, 0, sint), 
0<t<z7/2. 


We may conclude that F is conservative. In addition, if C is the ellipse 
x?/4 + y? = 1 (a simple, closed curve), then, by Theorems 3.2 and 3.3, we con- 
clude, without any explicit calculation, that fe F-ds=0. + 


EXAMPLE 6 Let 


X : y ‘ Z 
= ox ) i+ ! 
Css i ery tel Preys? 


F is of class C! on all of R? except for the origin. Note that R? — {(0, 0, 0)} is 
simply-connected. We leave it to you to check that V x F = 0 for all (x, y, z) in 
the domain of F. Therefore, by Theorem 3.5, F is conservative. 

Now suppose x: [0, 1] — R? is the path given by x(t) = (1 — t, sin 7t, t). To 
evaluate f F - ds directly, we must calculate 


l 1—t inst 
[r-as=[( a 6(1 — 1), 2 
o \( = t} + sinnt + t? (1 — t} + sin ant +t? 


t 


(1 — t} + sin? wt +t? 


K 1, x coszt, 1)dt 


1 : 
al (= —1+asinztcosmt -6a n) de 
o \ (A-t + sin att 2? 

This last integral is tricky to evaluate. However, since F is conservative, we may 
evaluate F by calculating iF -ds, where y is any other path with the same 
endpoints as x. A good choice is y(t) = (cost, 0, sint), 0 < t < 2/2, because 
the image of this path lies on the sphere x? + y? + z? = 1, a fact that will en- 
able us to work with a simple integral. (See Figure 6.37 for a graph of the two 
paths x and y.) Since F is conservative (and hence has path-independent line 
integrals), 


aie t 0 sint 
[v-as= [r-as= f (= — 6cost, —, ot) sinr, 0, cost) dt 
: ; : 1 ila 


m/2 m/2 
=i 6costsinrdi = f 3 sin 2t dt 
0 0 


m/2 


= —łġ cos 2th = —3(-1- 1) $3. 


Sketch of a proof of Theorem 3.5 By Theorem 4.3 of Chapter 3, note that, if 
F = Vf for some function f of class C?, then V x F = V x (Vf) = 0. 

Conversely, suppose that V x F = 0. We show that if C is any piecewise C!, 
simple, closed curve in R, then fe F - ds = 0. By Theorem 3.2, this implies that 
F has path-independent line integrals, which, by Theorem 3.3, is equivalent to 
F’s being the gradient of some scalar-valued function f. Moreover, since F is 
assumed to be of class C!, it follows that f must be of class C?. 


CD 


Figure 6.38 Since R is 
simply-connected, any region D 
enclosed by C must lie in R. 


C 


Figure 6.39 A 
surface in space, 
bounded by the 
simple, closed 
curve C. 
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To see that fo F-ds = 0, suppose, first, that F is defined on a simply- 
connected region R in R’. Since R is simply-connected, the closed curve C 
bounds a region D that is entirely contained in R. (See Figure 6.38.) By 
Proposition 2.2, which is equivalent to Green’s theorem, we have 


frais= f| oxra == f] oda =o. 


We use the “+” sign in the event that C is oriented opposite to the orientation 
stipulated by Green’s theorem. If F is defined on a simply-connected region R 
in R3, then we must apply Stokes’s theorem rather than Green’s theorem. This 
gets us a little ahead of ourselves, although the principle remains the same: If C 
is a simple, closed curve in R C R°, then fe F - ds is equal to a suitable surface 
integral of V x F over a surface in R bounded by C. (See Figure 6.39.) Because 
the curl is assumed to be zero, any integral of it will be zero, and so fọ F- ds is 
zero as well. 


Finding Scalar Potentials 


Now that we have a practical test to determine whether a given vector field F 
is conservative, we illustrate in Examples 7 and 8 a straightforward method for 
producing a scalar potential function for F. This technique is a direct consequence 
of the definition of a gradient field. 


EXAMPLE 7 Consider the vector field 
F = (2xy + cos2y)i+ (x? — 2x sin2y)j 


of Example 5. We have already seen that F is conservative in Example 5. To find 
a scalar potential for F, we seek a suitable function f(x, y) such that 


ð 0 
væ y= i+ jar 
Ox dy 
The components of the gradient of f must agree with those of F; therefore, 
a 
a = 2xy + cos2y 
Ox (2) 
0 
ws = x? — 2x sin2y 
dy 


We may begin to recover f by integrating the first equation of (2) with respect to 
x. Thus, 


f@,y= J2 dx = [oxy + cos2y yas = x’y + xcos2y+ g(y), (3) 


where g(y) is an arbitrary function of y. (The function g(y) plays the role of the 
arbitrary “constant of integration” in the indefinite integral of 0f/0x.) Differen- 
tiating equation (3) with respect to y yields 

ð 

Ta x? — 2x sin 2y + g'(y). (4) 

dy 
If we compare equation (4) with the second equation of (2), we see that g’(y) = 0, 
and so g must be a constant function. Therefore, our scalar potential must be of 
the form 


f(x, y) =x’y +x cos2y +C, 
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where C is an arbitrary constant. You may, if you wish, double-check that 
Vf =F. + 


EXAMPLE 8 LetF = (e* sin y — yz)i + (e* cos y — xz)j + (z — xy) k. Note 
that F is of class C! on all of R*. We calculate 


i j k 
VxF= a/ax 3/ðy ð/Əðz 


e*siny— yz e*cosy—xzZ zZz—xy 


0 0. 
= (z — xy) (e* cos y — xz) Ji 
dy dz 


3. ð . 
F (že sin y — yz) — (ze — »)i 
0z Ox 


O nf 0 

+ | —(e’ cos y — xz) — —(e’ sin y — yz) | k 
Ox dy 

=0. 


Therefore, by Theorem 3.5, F is conservative. 
Any scalar potential f(x, y, z) for F must satisfy 


of a 

— =e*siny—yz 

Ox 

0 

a oe (5) 
dy 

of 

a er 

oz z ” 


Integrating df/dx with respect to x, we find that 


0 
fayd = f Las 


= fe sin y — yz)dx 


=e* sin y — xyz + g(y, z), (6) 


where g(y, z) may be any function of y and z. Differentiating equation (6) with 
respect to y and comparing with the second equation in (5), we see that 


of ` dg 7 
=e cosy—xzZ+ — =e COSy— xz. 
dy dy 


Hence, dg/dy = 0, so g must be independent of y; that is, g(y, z) = A(z), a 
function of z alone. So 


f(x, y,z) =e" sin y — xyz + h(z). (7) 


Figure 6.40 If B’ is 
sufficiently close to B, then the 
straight-line path from B to B’ 
will lie inside R. 
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Finally, we differentiate equation (7) with respect to z and compare with the third 
equation of (5): 


a 
ws —xy +h'(z) =z — xy. 
əz 


Therefore, h'(z) = z, so h(z) = 52 + C, where C is an arbitrary constant. Thus, 
a scalar potential for the original vector field F is given by 


f(x,y, z) =e" sin y — xyz + $z + C. é 


Addendum: Proof of Theorem 3.3 


Recall that we have already shown that if a vector field F is a gradient field, then F 
has path-independent line integrals. So now we need only establish the converse. 
We do this explicitly in the case where F is defined on a (connected) subset R of 
R?, although our proof requires only notational modification in the n-dimensional 
setting. 

Assume that F has path-independent line integrals. Then we may unambigu- 
ously write sÈ F - ds to denote the vector line integral of F from the point A 
to the point B along any path whose image lies in R. In what follows, consider 
A(X0, Yo, Zo) to be a fixed point in R, and B(x, y, z) a “variable point.” Then we 
define 


B 
fay= f F-ds 


and show that f is a scalar potential for F. 
Write F explicitly as 


F= M(x, y,z)i+ N(x, y,z)j + PO, y, z)k. 


Therefore, we need to verify that the components of V f agree with those of F; 
that is, 
af 


0 0 
=~ = M(x, y, 2), OY Neni: and CA 
Ox dy Oz 


Actually, we check only the first of these equations; the others may be verified in 
a similar manner. 
At the point B, we have, by the definition of the partial derivative, that 
af Etan fey.) fB)- FB) 


= li li 
Ox h>0 h h>0 h 


(8) 


where B’ denotes the point (x + h, y, z). Note that B’ > B as h - 0. The nu- 
merator of the difference quotient in equation (8) is 


p(B) — f(B)= f Rods f Fds= [Peas (9) 


Ifh is sufficiently small, we may evaluate the line integral in equation (9) by using 
the straight-line path between B and B’. (See Figure 6.40.) Explicitly, this path is 


x(t)=(x+th,y,z), O<t<1. 
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6.3 Exercises 


1. Consider the line integral fo z? dx +2ydy+xzdz. 
(a) Evaluate this integral, where C is the line segment 


from (0, 0, 0) to (1, 1, 1). 


Then 


B' 1 1 
J F-as= | F(x + th, y, z)-(A, 0, oar= | hM(x + th, y, z)dt. 
B 0 0 


Since f is between 0 and 1, |th| < |A|. By the continuity of F (and therefore M) 
we have, for h ~ 0, 


M(x + th, y,z) ¥ M(x, y, 2). 


This approximation improves as h — 0. Hence, 


B' 1 
sE- f= f F-ds ~f hM(x, y,z)dt =hM(x, y, z). 


Using this result in equation (8), we see that 


of eck 
lim — 
h>0 h 


X 


[hM(x, y,z)] = M(x, y, 2), 


as desired. 


8. F = (6xy? + 2y°)i + (6x?y — xy)j 
9. F = (6xy? — 3x?) i + O? + 6x?y)j 


= 3 Dha 2,3 2 : 
(b) Evaluate this integral, where C is the path 10. F = (xyz T xy 5 2 Vit (2x°2" — y“ + 2yz)j 
from (0,0,0) to (1,1,1) parametrized by + (6x"y — y°z) k 
r=. USF = 1, 11. F = (4xyz3 — 2xy)i + (2x2z3 — x? + 2yz)j 
(c) Is the vector field F = z? i + 2y j + xz k conser- + (6x? yz" + y’)k 
ive? 9 
vative? Why or why not? 12. F = (2xz = y? + yze™)i = (2xy 4 xze)j 
2. Let F = 2xyi + (x? + 2*)j+2yzk. + (x2 + xye)k 
(a) Calculate f, F- ds, where C is the path param- : : 
etrized by x(t) = (£2, £3, 15), 0 < £ < 1. 13. F = (2x + y)i + (z cos yz + x) j + (y cos yz)k 
(b) Calculate he F + ds, where C is the straight-line 14. F=(y+z)it2zj+@ty)k 
path from (0,0, 0) to (1, 0,0), followed by the 46. F = e” sin yi + e” cos yi Ge + DE 
straight-line path from (1, 0, 0) to (1, 1, 1). a k PRIPP EA 
(c) Does F have path-independent line integrals? Ex- 16. F=3x?i+ Ž j +2zInyk 
plain your answer. y 
17. F = (e — yze Ji + xz(e + e®)j 
In Exercises 3—17, determine whether the given vector field + xy(e™ — e)k 
F is conservative. lf it is, find a scalar potential function for F. 
18. Of the two vector fields 
3. Fa=eVit+e”j 
Fa 5 12 trek 
4. F = 2xsinyi+ x* cosyj EE ae 
and 
= š 2 . 2 
5. F= (3:7c0sy + 55 Ji G = 2xyi + (x + 2yz)j + y“k, 
X 
a one is conservative and one is not. Determine which 
4 ( sin y + x ) j is which, and, for the conservative field, find a scalar 
1+ x2y? potential function. 
å xy? xy 19. (a) Let f bea function of class C! defined on a con- 


r l . 
UE T 1422 


7. F = (e — ysinxy)i— (xe™” + x sin xy)j 


nected domain in R”. Show that if the gradient of 
f vanishes at all x = (x1, .. 
then f is constant. 


., Xn) in its domain, 


20. 


21. 


22. 


23. 


24. 


25. 


(b) Suppose that F is a conservative vector field de- 
fined on a connected subset of R”. Show that if g 
and h are both class C! potential functions for F, 
then g and h must differ by a constant. 


Find all functions M(x, y) such that the vector field 
F = M(x, y)i + (x sin y — ycosx)j 
is conservative. 
Find all functions N(x, y) such that the vector field 
F = (ye™ +3xe)i+ N(x, y)j 
is conservative. 


Let G(x, y) = (ve* + y?)i+ xy j. Find all functions 
g(x) such that the vector field F = g(x)G is conserva- 
tive on all of R°. 


Find all functions N(x, y, z) such that the vector field 
F = (xy — 3x°z)i + N(x, y, z)j + (2yz — x°) k 
is conservative. 


For what values of the constants a and b will the vector 
field 


F = (3x? + 3y°z sin xz)i + (ay cos xz + bz) j 


+ Bxy? sinxz + 5y)k 
be conservative? 
Let F = x? i + cos y sin z j + sin y cos z k. 


(a) Show that F is conservative and find a scalar po- 
tential function f for F. 


(b) Evaluate f F -ds along the path x: [0, 1] > Rê, 
x(t) = (£ + 1, e', e”). 


Show that the line integrals in Exercises 26—28 are path inde- 
pendent, and evaluate them along the given oriented curve and 
also by means of Theorem 3.3. 


26. 


27. 


28. 


fe — 5y)dx + (7y — 5x) dy; C is the line segment 
C 


from (1, 3) to (5, 2). 
xdx + ydy. 
c Vx2+y2 ’ 


y? = 4 from (2, 0) to (—2, 0). 


C is the semicircular arc of x? + 


[eo — 3z)dx + (2x + z)dy + (y — 3x) dz; C isthe 
c 


line segment from the point (0, 0, 0) to (0, 1, 1) fol- 
lowed by the line segment from the point (0, 1, 1) to 
(1;2,3). 


In Exercises 29-32, find the work done by the given vector field 
F in moving a particle from the point A to the point B whose 
coordinates are as indicated. 


29. 
30. 
31. 


32. 


33. 


34. 


35. 


36. 


37. 
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F = Bx?y — y*)it+ (x — 2xy) j; A(0, 0), B(2, 1) 
F = 3/xyi + 2x? j; ACL, 2), B(9, 1) 


F = (2xyz — y’z’) i + (x°z — 2xyz°)j 
+ (x?y — 3xy?z?) K; A(1, 1, 1), B(6, 4, 2) 


F = 2xy cos zi + x? cosz j — x?y sin z k; 
AG, 1, 7/2), B(2, 3, 0) 


(a) Determine where the vector field 


x+xy?, x +l, 
1 J 


is conservative. 
(b) Determine a scalar potential for F. 


(c) Find the work done by F in moving a particle along 
the parabolic curve y = 1 + x — x? from (0, 1) to 


(1, 1). 


Let f, g, and h be functions of class C! of a single 
variable. 


(a) Show that F =(f(x)+y+z)i+ (x +8gy)+ 
z)j + (x + y + h(z))k is conservative. 


(b) Determine a scalar potential for F. (Your answer 
will involve integrals of f, g, and h.) 


(c) Find Sef - ds, where C is any path from 
(xo, Yo, Zo) to (x1, 1, Z1). 


Consider the vector field 
F = (2x + z)cos(x? + xz)i — (z + 1)sin(y + yz)j 
+ (x cos (x? + xz) — y sin (y + yz))k. 
(a) Determine if F is conservative. 
3 42 „Ti 
(b) Ifx(t)= (t,t (TeS ,0<t<1, evalu- 
ate f F- ds. 
Consider the vector field 


G = (2x + z) cos (x° + xz)i 
+(x -= (z+ Dsin(y + yz))j 
+ (x cos (x? + xz) — y sin (y + yz))k. 


(a) How is G different from the vector field F in Ex- 
ercise 35? Is G conservative? 


t 
(b) If x(t) = G t?, wt — sin +), 0<t<1, evalu- 
ate f, G-ds. 


Let F be the gravitational force field of a mass M ona 
particle of mass m: 


GMm 
(x? +y? + 22)3/ 


x (xit+yj+zk). 
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10. 


11. 


12. 
13. 


14. 
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(This is the force field of Example 3 in §3.3.) Given 
that G, M, and m are all constants, show that the 
work done by F as a particle of mass m moves from 


. IfC isthe parabola y = 4 — x? with —2 < x < 2, then 


Jey sinx ds = 0. 


. If F = —i + j + k and C is the straight line from the 


origin to (2, 2, 2), then fo F + ds = 2V3. 


. If F = xi + yj + zk and C is the straight line from 


(3, 3, 3) to the origin, then Je F - ds is positive. 


. Suppose that f(x) > 0 for all x. Let F = f(x)i If C 


is the horizontal line segment from (1, 1) to (2, 1), then 
Jc F-ds > 0. 


. Suppose that f(x) > 0 for all x. Let F = f(x)i If C 


is the vertical line segment from (0, 0) to (0, 3), then 
Jc F-ds > 0. 


. Ifx is a unit-speed path, then f, F » ds = Ler -v)ds, 


where v denotes the velocity of the path. 


. If x and y are two one-one parametrizations of the 


same curve and F is a continuous vector field, then 
J, F-ds= f,F+ds. 


. Ifa nonvanishing, continuous vector field F is every- 


where tangent to a smooth curve C, then F does no 
work along the curve. 


. Ifa nonvanishing, continuous vector field F is every- 


where normal to a smooth curve C, then F does no 
work along the curve. 


If the curve C is the level set at height c of a func- 
tion f(x, y), then Je f(x, y)ds is c times the length 
of C. 


If f(x,y,z) is a continuous function and 
fe f(x, y,z)ds = 0 for all curves C in R, then 
f(x, y, z) = 0 for all (x, y, z) € RÈ. 


If a closed curve C is a level set of a function f(x, y) 
of class C! and V f Æ 0, then the flux of V f across C 
is always zero. 


If a closed curve C is a level set of a function f(x, y) 
of class C! and V f Æ 0, then the circulation of V f 
along C is always zero. 


Ifa vector field F has constant magnitude 3 and makes 
a constant angle with a curve C, then the work done 
by F along C is 3 times the length of C. 


15. 


16. 


17. 


18. 


19. 
20. 


21. 


22. 


23. 


24. 


25. 


26. 
27. 


28. 


29. 


30. 


Xo = (Xo, Yo, Zo) to X1 = (x1, y1, z1) depends only on 
lixol] and |x ll. 


If F is a continuous vector field everywhere tangent to 
an oriented C! curve C, then f, F + ds = fo ||F|| ds. 


If F is a constant vector field on R?, then fc F-ds=0, 
where C is any simple, closed curve. 


If F is an incompressible (i.e., divergenceless) C! vec- 
tor field on R? and C isa simple, closed curve, then 
the circulation of F along C is always zero. 


If F is an incompressible C 1 vector field on R, then 
the flux across any simple, closed curve C in R? is 
always zero. 


If C is a simple curve in R?, then To Vf-ds=0. 

If C is a simple, closed curve in R? and f is of class 
C', then fo Vf + ds = 0. 

F = (e* cos y + 3)i— e” sin yj is a conservative vec- 
tor field on R?. 


If f and g are functions of class C! defined on a region 
D in R?, then 


§ fVg -ds = $ gV f eds. 

aD aD 

If C is a closed curve in R? such that f.F+ds = 0, 
then F is conservative. 

foxdx+ydy+zdz=0 for all simple, closed 
curves C in R°. 


fc e*(cos y sinz dx + sin y sin z dy + cos ycosz)dz 
= 0 for all simple, closed curves C in R3. 

If V x F = 0, then F is conservative. 

Let M(x, y) and N(x, y) be C! functions with domain 
R? — {(0, 0)}. If 9M/dy = ON/dx, then fo M dx + 
N dy = 0 for any closed curve C in R°. 

Let M(x,y,z), N(x, y,z) and P(x, y,z) be C! 
functions with domain R? — {(0, 0, 0)}. If 3M/3y = 
ƏN/ðx, ƏM/ðz = ƏðP/ðx, and dIN/dz = d0P/dy, 
then fọ Mdx+ Ndy+Pdz=0 for any closed 
curve C in R°. 


If F: R” —> R”, then there is at most one function 
f:R — R such that V f = F. 


If F is a differentiable vector field and F = V x G, 
then V F = 0. 
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Miscellaneous Exercises for Chapter 6 


Let C CR" be a piecewise C! curve and f:X CR" SR 
a continuous function whose domain X includes C. Then we 
define the quantity 


Sofas _ Je fds 


[Flav = Jods length of C 


to be the average value of f along C. Exercises 1—5 concern 
the notion of average value along a curve. 


1. Explain why it makes sense to use the preceding inte- 
gral formula to represent the average value. (A careful 
explanation involves the use of Riemann sums.) 


2. Suppose that a thin wire is shaped as a helical curve 
parametrized by 


x(t) = (cost, sint, t), 0 <t < 3r. 


If f(x, y, z) = x? + y? + 2z? + 1 represents the tem- 
perature at points along the wire, find the average tem- 
perature. 


3. Find the average y-coordinate of points on the upper 


semicircle y = va? — x?. 


4. Find the average z-coordinate of points on the broken- 
line curve pictured in Figure 6.41. 


(2,0, 0) 


x (2, 1,0) 


Figure 6.41 The broken-line curve of 
Exercise 4. 


5. Find the average value of f(x, y, z) = z? + xe” on the 
curve C obtained by intersecting the (elliptic) cylinder 
x?/5 + y? = 1 by the plane z = 2y. 


6. A metal wire is bent in the shape of the semicircle 
x? +y? = 4, y > 0, lying in the xy-plane. Suppose 
that the mass density at each point (x, y, z) of the wire 
is (x, y, z)=3-— y. 

(a) Find the total mass of the wire. 
(b) Using formulas analogous to those in §5.6, we de- 
fine the (first) moments of the wire to be 


/ xd(x, y, z)ds, f yd(x, y, z)ds, 
c c 


/ zô(x, y, z)ds. 
C 


Using these definitions, find the coordinates of the 
center of mass of the wire. 


7. Suppose that a wire is bent in the shape of a quarter 
circle of radius a. Find the center of mass of the wire 
if the density at points on the wire vary as the square 
of the distance from the center of the wire. 


8. (a) Find the centroid of the helical wire x = 3 cost, 
y = 3sint, z = 4t, where 0 < t < 4x. (Hint: No 
calculation should be necessary.) 

(b) Find the center of mass of the same wire if the den- 
sity at each point of the wire is equal to the square 
of the point’s distance from the origin. 


Ifa thin wire is bent in the shape of a curve C in the xy-plane 
and has mass density at each point along the curve given by a 
continuous function ô(x, y), then we may define the moments 
of inertia of C about the x- and y-axes, respectively, by 


L= f ads p= f 28 y)as, 
C C 


and the corresponding radii of gyration of C as 


[T, i; 
Ta =N r ry == r 
M : M 


where M denotes the total mass M = Jo ô(x, y) ds of the wire. 
Additionally, the moment of inertia of C about the origin (or, 
equivalently, about the z-axis, if we think of the xy-plane as 
embedded in R) is 


t= | œ +a, y)as, 
E 


with corresponding radius of gyration r, = J1,/M. Exer- 
cises 9-13 concern moments of inertia of curves. 


9. (a) Consider the wire of Exercise 6 again. Find its mo- 
ment of inertia about the y-axis. 


(b) What is the radius of gyration r, for the wire about 
the z-axis? 


10. Find the moment of inertia Zy and the radius of gy- 
ration rx about the x-axis of a straight wire between 
(—2, 1) to (2, 3) whose density varies along the wire 
as d(x, y) = y. 


11. Find the moment of inertia 7, and the radius of gyra- 
tion rx about the x-axis of a wire shaped as the curve 
y = x? between (0,0) and (2, 4) and whose density 
varies as d(x, y) = x. 


12. (a) Suppose that a thin metal wire is bent into a 
curve C in R? and has mass density at each point 
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13. 


14. 


15. 
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(x, y, z) along the curve given by a continuous 
function ô(x, y, z). Give general formulas analo- 
gous to those in §5.6 for the moments of inertia of 
the wire about the three coordinate axes. 


(b) Find the moments of inertia about the coordinate 
axes of a homogeneous (i.e., constant density) 
wire shaped like the helix x = 3 cost, y = 3 sint, 
z = 4t, where 0 < t < 4x. What are the radii of 
gyration? 


Find the moment of inertia /, about the z-axis of a wire 
in the shape of the line segment between (—1, 1, 2) 
and (2, 2, 3) if the density along the segment varies as 
(x, y, z) = 14 22. What is r,? 


Let r = f(@) be the polar equation of a curve in the 
plane. 


(a) Use scalar line integrals to show that the arclength 
of the curve between (f(a), a) and (f(b), b) is 


b oo 
| VOGON + (FON d0. 


(b) Sketch the curve r = sin? (8/2) and find its length. 


(a) Give a formula in polar coordinates for the scalar 
line integral of a function g(x, y) along the curve 
r= f(0),a<6<bD. 

(b) Compute fo gds, where g(x, y) = x? + y? — 2x 
and C is the segment of the spiral r = e*’, 0 < 
0 <27. 


Let C be a piecewise C!, simple curve in R°. The total curva- 
ture K of C is 


K= f ds, 
G 


where x denotes the curvature of C. (See §3.2 to review the no- 
tion of curvature.) Exercises 16—20 involve the notion of total 
curvature. 


16. 


17. 


18. 


19. 


Show that if C is a simple curve of class C! 
parametrized by x(t), a < t < b, then 


b 
vxa 
=f Ix al 
a Ivl 


(Recall that v and a denote, respectively, the velocity 
and acceleration of the path x.) 


Find the total curvature of the helix 


x(t) = (3 cosż, 3 sinz, 4t), 0<tx<l0r. 


Find the total curvature of the parabola y = Ax?, 
A > 0, fora <x <b. 


Fenchel’s theorem states that if C is a simple, closed 
C! curve in R?, then K > 27 and, moreover, K = 27 
if and only if C is a plane convex curve. (A simple, 
closed curve C is convex if the line segment joining 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


any two points of C lies entirely in the region en- 
closed by C.) Verify Fenchel’s theorem for the ellipse 
x?’ /a + y’ /b =1. 


Let C be a simple, closed C! curve in R?. Suppose that 

the curvature « of C is bounded (i.e.,0 < x < 1/a for 

some a > 0). 

(a) Show that if L denotes the length of C, then 
L > 2ra. 


(b) What can you say about C if L = 27a? 


Calculate the work done by the vector field F = 
sin x i + cos y j + xz k on a particle moving along the 
path x(t) = (0, -a t), where 0 <ż <1. 


Use Green’s theorem to find the work done by the vec- 
tor field F = x? yi + (x + y)yj in moving a particle 
from the origin along the y-axis to the point (0, 1), 
then along the line segment from (0, 1) to (1, 0), and 
then from (1,0) back to the origin along the x-axis. 
(Warning: Be careful.) 


Use Green’s theorem to recover the formula 
1 f? 3 
A= 5 (f(@))° dé 
for the area A of the region D described in polar coor- 
dinates by 
D=({(r,0)|0<r< f(@),a <0 <b}. 


Let C be a piecewise C!, simple, closed curve in R?. 
Show that 


f EAT 0. 


where f and g are any single-variable functions of 
class C!. 


Let D be a region in R? whose boundary 3D consists 
of finitely many piecewise C!, simple, closed curves 
oriented so that D is on the left as you travel along any 
segment of dD. If (x, y) denotes the coordinates of the 
centroid of D, show that 


1 
x ef x? dy 
2-areaof D Jəp 


and 


1 
y = ——__ dy. 
Z area of D $» i 
Also show that 


1 
X = —— xydx 
area of D Jəp 
and 
3 1 2 
y= — y“ dx. 
i 2 -area of D Jap 


Use the results of Exercise 25 to find the centroid of the 
triangular region with vertices (0, 0), (1, 0), and (0, 2). 


27. Use the results of Exercise 25 to find the centroid of 
the region in R° that lies inside the circle of radius 6 
centered at the origin and outside the two circles of 
radius 1 centered at (4, 0) and (—2, 2), respectively. 


28. Let C be a piecewise C!, simple, closed curve, ori- 
ented counterclockwise, enclosing a region D in the 
plane. Let n be the outward unit normal vector to D. 
If f(x, y) and g(x, y) are functions of class C? on D, 
establish Green’s first identity: 


[|e +r- vda = f SVs nas. 


29. Under the hypotheses of Exercise 28, prove Green’s 
second identity: 


[[uve-evnaa = Ve -8Y f)-nds. 


A function g(x, y) is said to be harmonic at a point (xo, yo) if 
g is of class C? and satisfies Laplace's equation 
ag 8g 
Vee = —2 4+ —2 =0 

E= 9x2 3y? 
on some neighborhood of (xo, yo). We say that g is harmonic on 
a closed region D C R? if it is harmonic at all interior points 
of D (i.e., not necessarily along ð D). Exercises 30—33 concern 
some elementary results about harmonic functions in R?. 


30. Suppose that D is compact (i.e., closed and bounded) 
and that 3 D is piecewise C! and oriented as in Green’s 
theorem. Let n denote the outward unit normal vector 
to 0D and let dg/dn denote Vg » n. (The term ðg/ðn 
is called the normal derivative of g.) Use Green ’s first 
identity (see Exercise 28) with f(x, y) = 1 to show 
that, if g is harmonic on D, then 


31. Let f be harmonic on a region D that satisfies the 
assumptions of Exercise 30. Show that 


f| nintak tas. 


32. Suppose that f(x, y) = 0 for all (x, y) € 0D. Use Ex- 
ercise 31 to show that f(x, y) = 0 throughout all of 
D. (Hint: Consider the sign of Vf -V f.) 


33. Let D be a region that satisfies the assumptions of Ex- 
ercise 30. Use the result of Exercise 31 to show that if 
fı and fọ are harmonic on D and fi (x, y) = fo(x, y) 
on ð D, then, in fact, fı = fə on all of D. Thus, we see 
that harmonic functions are determined by their values 
on the boundary of a region. (Hint: Consider fı — f2.) 


34. We call a vector field F on R? radially symmetric if 
it can be written in spherical coordinates in the form 
F = f(p)e,, where e, is the unit vector that points in 
the direction of increasing p-coordinate. (See §1.7.) 
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(a) Give an example of a (nontrivial) radially sym- 
metric vector field, written in both Cartesian and 
spherical coordinates. 

(b) Show that if f is of class C! for all p > 0, then 
the radially symmetric vector field F = f(p)e, is 


conservative. 
35. Let F= DT, 
x2 + y2 


(a) Verify Green’s theorem over the annular region 
D= {[@,y)|a@ <x? +y? <1). 


(See Figure 6.42.) 


x? +y? =1 


Figure 6.42 The annular region 
of Exercise 35(a). 


(b) Now let D be the unit disk. Does the formula of 
Green’s theorem hold for D? Can you explain why? 


(c) Suppose C is any simple, closed curve lying out- 
side the circle 


Ca = {(x, y) | 4 +y _ a°}. 


y 


Figure 6.43 The curves C and C, of 
Exercise 35(c). 


(See Figure 6.43.) Argue that if C is oriented coun- 
terclockwise, then 


xdy — ydx 
—z z 52. 
c x+y 
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36. 


37. 


38. 
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y x 


Consider the vector field F = - i+ 
x2 an y? x2 + y 


zi 

(a) Calculate V x F. 

(b) Evaluate f, F -ds, where C is the unit circle 
er+y=1, 

(c) Is F conservative? 

(d) How can you reconcile parts (a) and (b) with 
Theorem 3.5? 


(a) Let F = e” i + x*j. Calculate the flux f, F -nds 
of F across the boundary of the rectangle R = 
[0, 1] x [0, 5]. 

(b) Let f and g be of class C! and let F= f(y)i+ 
g(x) j. Show that the flux of F across any piecewise 
C! simple, closed curve is zero. 


Use Newton’s second law of motion F = ma to show 
that the work done by a force field F in moving a par- 
ticle of mass m along a path x(t) from x(a) to x(b) is 


39. 


equal to the change in kinetic energy of the particle. In 
other words, 


fF -ds = tm(v(b)P — 4m(v(a)y’, 


x 


where v(t) = ||v(t)||, the speed at time t. (Use the prod- 
uct rule for dot products of vector-valued functions.) 


Let F be a conservative vector field on R? with F = 
—VV. If a particle travels along a path x, recall that 
its potential energy at time ż is defined to be V(x(r)). 
Use line integrals to prove the law of conservation of 
energy: As a particle of mass m moves between any 
two points A and B in a conservative force field, the 
sum of the potential and kinetic energies of the particle 
remains constant. (Use Exercise 38 and Theorem 3.3.) 
The use of line integrals provides an alternative proof 
of Theorem 4.2 in $4.4. 
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7.1 Parametrized Surfaces 


Introduction 


Surfaces in R? may be presented analytically in different ways. Here are two 
familiar descriptions: 


1. as a graph of a function of two variables, that is, as points (x, y, z) in R? 
satisfying z = f(x, y) (e.g., z = x? + 4y”); 

2. as a level set of a function of three variables, that is, as points (x, y, z) 
such that F(x, y, z) = c for some suitable function F and constant c (e.g., 
xy = zy +x = 1). 

Both of these descriptions are problematical. As noted in §2.1, many common 
surfaces cannot be described as graphs of functions of two variables. Recall, for 
example, that the full sphere x? + y? + z? = 1 is not the graph of a function of 
two variables. Therefore, description 1 is not sufficiently general. 

There are also problems with description 2. Not all equations of the form 
F(x, y, z) = c have solutions that fill out surfaces. Indeed, although the level set 
of F(x, y, z) = x? + y? + z? at height 1 is a sphere, at height 0 it is a single point, 
and at height — 1 completely empty. In addition, it is somewhat tricky to describe 
surfaces (i.e., two-dimensional objects) in R” by using level sets when n is larger 
than 3. Another approach is desirable for presenting surfaces analytically, in order 
to avoid the problems just mentioned, to emphasize clearly the two-dimensional 
nature of a surface and to facilitate subsequent calculations. With this discussion 
in mind, we state the following definition: 


DEFINITION 1.1 Let D be a region in R? that consists of a connected open 
set, possibly together with some or all ofits boundary points. A parametrized 
surface in R? is a continuous function X: D C R? > R? that is one-one on 
D, except possibly along ð D. We refer to the image X(D) as the underlying 
surface of X (or the surface parametrized by X) and denote it by S. (See 
Figure 7.1.) 


The restrictions on the region D and the map X of Definition 1.1 are meant to 
ensure that D is a two-dimensional subset of R? with a two-dimensional image. 
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t Z 


X(D)=S 


<< 


Figure 7.1 A parametrized surface. 


If we write the component functions of X, then, for (s, t) € D, 


X(s, t) = (x(s, t), y(s, t), z(s, £)), 


and the underlying surface S can be described by the parametric equations 


x = x(s,t) 
y=y(s,t) (s,t)e€ D. (1) 
z= 2(s,t) 


EXAMPLE 1 Consider the parametrized surface X: R? —> R? described by 
X(s, t) = si — j) + ti + 2k) + 3j. 


The image of X, shown in Figure 7.2, is the plane through the point (0, 3, 0), 
determined by the vectors a =i— j and b = i+ 2k. (See Proposition 5.1 
of §1.5.) + 


Figure 7.2 The parametrized plane of Example 1. 


EXAMPLE 2 Let D = [0, 27) x [0, 2] and consider X: D —> R? given by 
X(s, t) = (a coss sint, asins sint, a cost). 
The corresponding parametric equations are 


x =acoss sint 
y =asins sint O<s<2n,0<t<zZ. 
z =acost 


The parametric equations imply that x? + y? + z? = a?, meaning all the points 


of S = X(D) lie on a sphere of radius a centered at the origin. The paramet- 
ric equations are precisely the spherical-rectangular coordinate conversions (see 
§1.7) with the p-coordinate held constant at a and with s and f used instead of 6 
and gy. Hence, the image of X is indeed all of the sphere. (See Figure 7.3.) + 
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t Z 

Image of 
=0 
T A X 5 
l —_—— 
s=0 i 
— 
27 y 

x 


Figure 7.3 A sphere rendered as a parametrized surface. 


EXAMPLE 3 The points of the surface parametrized by 


x =acoss 
y=asins O<s<2n 
z=t 


satisfy the equation x? + y? = a? and so can be seen to form an infinite cylinder 
of radius a. Figure 7.4 shows how the function X(s, t) = (a cos s, a sin s, t) maps 
the infinite strip D = {(s, t) | 0 < s < 2x} onto the cylinder by “gluing” the line 
s=Otos =2z. + 


Z 
a i 
l 
i 
A 4 J 


20 


€ 


-2 


Figure 7.4 The map X glues together the edges of D to form 
a cylinder. 


EXAMPLE 4 Let D C R? be an open set, possibly together with some or all of 
its boundary points. If f: D — R is a continuous scalar-valued function of two 
variables, then it is not difficult to parametrize the graph of f: We let 


X:D—>R*> be X(s,t)=(s,t, f(s, t). 


That is, the parametric equations 


X=S 
y=t (s,t)e D 
z= f(s,t) 
describe the points of the graph of f. (See Figure 7.5.) + 


Coordinate Curves, Normal Vectors, 
and Tangent Planes 


Let S be a surface parametrized by X: D —> R?. If we fix t = tọ and let only s 
vary, we obtain a continuous map 


s — X(s, to), 
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X(D) is graph of z = f(x, y) 


Figure 7.5 The graph of z = f(x, y) as a parametrized surface. 


whose image is a curve lying in S. We call this curve the s-coordinate curve at 
t = tọ. Similarly, we may fix s = so and obtain a map 


t —> X(so, t), 


whose image is the t-coordinate curve at s = so. Figure 7.6 suggests the appear- 
ance of the coordinate curves. 


jae x X(s, to) 
aD- 
=l 


Figure 7.6 The coordinate curves of a parametrized surface. 


EXAMPLE 5 The points of the parametrized surface T defined by 


x = (a + b cost) coss 
y = (a + b cost)sins 
z = bsint 


O<s <27,0<t<27; 
a, b positive constants with a > b, 


satisfy the equation 


(VEF a) +2 =b. 


The s-coordinate curve at t = 0 is 


(a+ b) coss 
(a + b)sins 
0 


Ne & 
II 


II 


and is readily seen to be the circle of radius a + b, centered at the origin and lying 
in the xy-plane. In general, you may check that the s-coordinate curve at t = to 
is a circle of radius a + b cos tọ (which varies between a — b and a + b) in the 
horizontal plane z = b sin tọ. (See Figure 7.7.) 
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i Z 
2m 
m X s-coordinate 
curve at t = 7/2 s-coordinate 
n/2 \ curve at t=0 
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20 y 
s-coordinate 
x curve att=7 


Figure 7.7 Some s-coordinate curves for the torus T of Example 5. 


The t-coordinate curve at s = 0 is 


x =a +bcost 
y=0 
z = bsint 


The image is a circle of radius b centered at (a, 0, 0) in the xz-plane (i.e., the 
plane y = 0). At s = sọ, the t-coordinate curve is 


x = cos sọ (a + b cost) 
y = sin so (a + b cost) . 
z = bsint 


Along this curve, we have y/x = tan sọ, a constant. The curve lies in the vertical 
plane (sin so)x — (cos so)y = 0. Moreover, it is not hard to see that the distance 
from any point on this curve to the point P(a cos so, a sin sọ, 0) is b, and the 
image is a circle of radius b centered at P. See Figure 7.8 for examples of 
t-coordinate curves. 


t 


S 


m2 m 3n/2 2r 


Figure 7.8 Some t-coordinate curves for the torus T of Example 5. 


The aforementioned surface T is called a torus, a doughnut-shaped surface 
shown in Figure 7.9. It is generated both by the collection of the s-coordinate 
curves and by the collection of the t-coordinate curves. + 


Suppose that X(s, t) = (x(s, t), y(s, t), z(s, t)), where (s, t) € D, is a dif- 
ferentiable (or C!) map, in which case we say that the parametrized surface 
S = X(D) is differentiable (or C!) as well. Then the coordinate curves X(s, fo) 
and X(so,t) have well-defined tangent vectors at points (so, to) in D. (See 
Figure 7.10.) To find the tangent vector T, to the s-coordinate curve X(s, fo) 
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X 
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So È 


Figure 7.9 The parametrized torus T. 


s-coordinate 
curve at t= to 


t-coordinate 
curve at $ = So 


} S 
So 


x 


Figure 7.10 The tangent vectors T, and T, to the coordinate curves. 


at (so, to), we differentiate the component functions of X with respect to s and 
evaluate at (so, to): 


ox Ox s. 10 > OZ 
T, (so, to) = —(So, fo) = —(So, to) i + LN to)j + (So, to) K. (2) 
os ORY RY Os 


Similarly, the tangent vector T, to the t-coordinate curve X(sọ, t) at (so, to) may 
be calculated by differentiating with respect to t: 


ox Ox 0 at OZ 
T; (so, fo) = 5; bso» to) = zC” to)i + =o to)j + 3, C” to) k. (3) 


Since T, and T, are both tangent to the surface S at (so, to), the cross product 
T; (so, to) x T;(s0, to) will be normal to S at (so, fo), provided it is nonzero. 


DEFINITION 1.2 The parametrized surface S = X(D) is smooth at 
X(so, to) if the map X is of class C! in a neighborhood of (so, to) and if 
the vector 


N(so, to) = Ts(so, to) x T: (So, to) Æ 0. 


If S is smooth at every point X(so, fo) € S, then we simply refer to S as a 
smooth parametrized surface. If S is a smooth parametrized surface, we call 
the (nonzero) vector N = T, x T, the standard normal vector arising from 
the parametrization X. 
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EXAMPLE 6 We claim that the torus T of Example 5 is smooth. Recall that T 
is given as the image of the map 


X: [0, 277] x [0, 277] > Rè, 
X(s, t) = ((a + b cost) coss, (a + bcost)sins, b sint), 
where a > b > 0. Then from formulas (2) and (3), we have 
T,(So, to.) = —(a + b cos to) sin so i + (a + b cos to) cos so j 
and 
T; (so, to) = —b sin to cos sọ i — b sin to sin sọ j + b cos to k, 
so that 
T, x T, = (a + b cos to)b cos to cos so i + (a + b cos ty) cos to sin so j 


+ (a + b cos to)b sin tọ k 


= b(a + b cos to )(cos to cos sọ i+ cos tọ sin sọ j + sin tọ K). 


Since a > b > 0, the factor b(a + b cos tọ) is never zero. Furthermore, since the 
sine and cosine functions are never simultaneously zero, at least one component 
of T, x T, is never zero. Hence, the torus is a smooth parametrized surface. ® 


s=-—2 t gs=2 g 


s-coordinate curve 
at t= 7/4 


t=n/4 coordinate ee 
curve at s=2 \Y 


y 
t-coordinate 


\ curve at s =—2 


Figure 7.11 The cone z? = x? + y° as a parametrized surface. 


EXAMPLE 7 The equation z? = x? + y? defines a cone in RÌ. (See Figure 
7.11.) If z is held constant (which corresponds to slicing the surface by a 
horizontal plane), then the expression x? + y? is constant. Hence, the constant-z 
cross sections are circles of radius |z| or a single point in the case of the vertex. 
This suggests that we can parametrize the cone by using one parameter variable 
for z and another for the angle around the z-axis. Thus, we obtain the following 
equations: 


x = s cost 
y=ssint 0<t<2z. 
Z=S 


Then we have 


T, =costi+sintj+k and T, =—ssinti+scostj. 
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Therefore, 
i j k | 
T,xT,= cost sint 1 | = —s costi — s sint j+ sk. 
—ssint scost 0 
Note that T, x T, = 0 when (and only when) s = 0. The cone fails to be smooth 


just at its vertex (the single point of the underlying surface corresponding to 
s=0). + 


Examples 6 and 7 suggest why the terminology “smooth” is used: Intuitively, 
a parametrized surface is smooth at a point if it has no sharp “cusps” or “corners” 
there. This is true of the torus but not of the cone, which has a singularity at its 
vertex. 

If a parametrized surface is smooth at a point X(so, to), then we define the 
tangent plane to S$ = X(D) at the point X(so, to) to be the plane that passes 
through X(so, to) and has 


N(50, to) = Ts(80, to) x Ti(so, to) 


as normal vector. To write an equation for this plane, we denote (x, y, z) by the 
(variable) vector x. Then the tangent plane equation is 


N(50, to) * (X — X(50, to)) = 0. (4) 


If we write the components of N(so, tọ) as Ai+ Bj+Ck and X(so, to) as 
(x(So, to), y(So, to), Z(So, to)), then we may expand equation (4) to obtain 


A(x — x(So, to)) + B(y — y(So, to)) + C(z — z(so, to)) = 0. (5) 


EXAMPLE 8 Consider once again the parametrized cone of Example 7 as 
X(s, t) = (s cost, s sint, s). 


From the calculations in Example 7, the cone is smooth at the point (0, 1, 1) = 
X(1, 7/2), and so the tangent plane exists at that point. We have 


T , T , 
T. (1.5) =J+k and T, (5) =-i, 
so that 


7 . i k 
N=T, (1,5) xT, (1,7) = 0 1l 1ļ=-j+k. 
2 2 £ F 


Hence, equation (5) can be applied to verify that an equation for the tangent 
plane is 


Ox —0)- ly-D+hz-D=0 
or, more simply, 


Z=y. + 


EXAMPLE 9 If f(x, y) is of class C! in a neighborhood of a point (xo, yo) 
in its domain D, then the graph of f is a smooth parametrized surface at 
(xo, yo, f (Xo, yo)). Recall from Example 4 that the graph of f is parametrized by 


Figure 7.12 A cube. 
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X: D > R?, X(s, t) = (s, t, f(s, t)). Then 


of of 
T, =i k d T =j k, 
ur an t its, 
so that 
f. of, 
=T, xT, =-—i- — k 
pi eee ae 


Note that N is nonzero at any point (s, t, f(s, t)) = (x, y, f(x, y)). 
Next, consider the surface defined as the level set 


S = {(x, y, zZ) | F(x, y, z) = c, c constant}. 


If F is of class C! in a neighborhood of a point (xo, yo, Zo) € S and VF (xo, Yo, 
Zo) Æ 0, then the implicit function theorem (Theorem 6.5 of §2.6) implies that, in 
principle at least, the defining equation F(x, y, z) = c of S always can be solved 
locally for (at least) one of the variables x, y, or z in terms of the other two. 
In other words, under the given assumptions on F, the level set S is locally the 
graph of a C! function of two variables. It is important to remember that the 
idea of “solving locally” does not mean that all points of S can be described as 
the graph of a single function (as we already know quite well in the case of the 
sphere x? + y? + z? = 1, forexample), butrather that, near points (xo, yo, zo) € S 
where V F (xo, yo, zo) Æ 9, a portion of S may be described as a graph. Hence, 
graphs of C! functions of two variables and level sets of C! functions of three 
variables, under certain smoothness hypotheses, are locally equivalent descrip- 
tions for surfaces. Moreover, since the graph of a C! function is a smooth 
parametrized surface, we may shift relatively freely among our three descriptions 
for surfaces. + 


Smooth parametrized surfaces are of primary importance because of the ease 
with which we may adapt techniques of calculus (particularly integral calculus) 
to them. But we are also interested in piecewise smooth parametrized surfaces. 


DEFINITION 1.3 A piecewise smooth parametrized surface is the union of 
images of finitely many parametrized surfaces X;: D; > Roce M. 
where 

* Each D; is a region in R? consisting of a connected open set, possibly 


together with some of its boundary points (for the most part, we want D; 
to be an elementary region); 

* Each X; is of class C! and one-one on all of D;, except possibly along 
dD;; 

° Each S; = X;(D;) is smooth, except possibly at finitely many points. 


EXAMPLE 10 The surface of a cube is a piecewise smooth parametrized 
surface. It is the union of its six faces, each one of which is a smooth parametrized 
surface, namely, a portion of a plane. 

More explicitly, suppose that a cube’s faces are portions of the planes x = 0, 
x = l,y = 0,y =1,z =0,andz = | asin Figure 7.12. Then we may parametrize 
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the cube’s faces by X;:[0, 1] x [0, 1] > R?,i = 1,..., 6, where 
Xi (s, t) = (0, s, t); Xalis t) = (1, s, 1); X3(s5, t) = (s, 0, t); 
X4(s, t) = (s, 1,0); X5(s, t) = (s, t, 0); Xo(s, t) = (s, t, 1). 


Each map X; is clearly of class C! and one-one. In addition, the faces have 
well-defined nonzero normal vectors. For example, for both X; and Xp, 


N,; =N2.=T, xT; =jxk=i. 
Similarly, 
N; = N4 =ixk= -j and Ns =No=ixj=k. 


None of these vectors vanishes. There is no consistent way to define normal 
vectors along the edges of the cube (where two faces meet). That is why the cube 
is only piecewise smooth. + 


Area of a Smooth Parametrized Surface 


Now, we use the notion of a parametrized surface to calculate the surface area of a 
smooth surface. In the discussion that follows, we take S = X(D) to be a smooth 
parametrized surface, where D is the union of finitely many elementary regions 
in R? and X: D — R? is of class C! and one-one except possibly along 0D. 


t zZ 


S=X(D) 


AA 
(50, to) 


x 


Figure 7.13 The image of the As x At rectangle in D is approximately a 
parallelogram spanned by T,(so, to) As and T,(so, fo) At. 


The key geometric observation is as follows: Consider a small rectangular 
subset of D whose lower left corner is at the point (so, to) € D and whose width 
and height are As and At, respectively. The image of this rectangle under X is a 
piece of the underlying surface S that is approximately the parallelogram with a 
corner at X(so, to) and spanned by the vectors T;(so, to) As and T;(so, t9) At. (See 
Figure 7.13.) The area AA of this piece is 


AA X ||Ts(80, to AS x T,(So, to) Atl] = IITs (0, to) x Tr(S0, to) || As At. 


Now, suppose D = [a, b] x [c, d]; that is, suppose D itself is a rectangle. 
Partition D into n? subrectangles via 


a = s0 <sj<-::<S,=b and c=) <t <.: <t =d. 


Let As; = si — si-1 and At; = t; —t;-; fori, j = 1,..., n. Then S is in turn 
partitioned into pieces, each of which is approximately a parallelogram, assuming 
As; and At; are small fori, j = 1,...,n. If AAj;; denotes the area of the piece 
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of S that is the image of the ijth subrectangle of D, then 


Surface area of $ = > AAij 


i,j=l 


x > IIT s@i-1, tj-1) x Tr(si-1, tji) As; At;. 


i j=l 


Therefore, it makes sense to define 


n d b 
Surface area of S = lim o 5 AAjj = / / IT; x T;|| ds dt 
Zí ij=l C a 


all As;, At; 


and, in general, where D is an arbitrary region (i.e., not necessarily a rectangle), 


Surface area of S = I IT, x T,|| ds dt. 
D 


Formula (6) can be extended readily to the case where S is a piecewise smooth 
parametrized surface by breaking up the integral in an appropriate manner. 


EXAMPLE 11 We use formula (6) to calculate the surface area of a sphere of 
radius a. Recall from Example 2 that the map 
X(s, t) = (acosssint,asinssint,acost), O<s<2z, O<t<az 


parametrizes the sphere in a one-one fashion. Then 


T, = —a sins sinti+acoss sintj 
and 
T; = acoss costi+asins costj—asintk. 
Hence, 
T, x T, = —a’ sint (coss sinti+sins sint j+ cost k), 
so that 


IT, x T,|| = a’ sint. 


Therefore, formula (6) becomes 
T 20 m 
Surface area = / J a’ sint ds dt = / 27ra’ sint dt 
o Jo 0 


= 2ra’ (— cost); = 2ra?’ (1 + 1) = 4ra’. 


This result checks with the well-known formula for the surface area of a sphere. 
Note, however, that if we let 0 < s < 47,0 < t < x, then the image of X is the 
same sphere, but formula (6) would yield 


m Ar 
| / a’ sint ds dt = 87ra’. 
o Jo 


This “overcount” is why we must assume that the map X is (nearly) one-one. 
(Note: The aforementioned parametrization fails to be smooth at t = 0 and at 
t = x, but this is along 0D and so does not affect the surface area integral.) @ 
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If we write the component functions of X as 
X(s, t) = (x(s, t), y(s, t), z(s, £)), 


we find that 
i j k 
ð a) a 
Tx =| 2 Z 
Os Os Os 
Ox dy Oz 
ot ot ot 


II 


ðy əz ððyəðz\. əx əz Oxd0z\, ðxəðy Ox doy k 
i4 H 
ðs ðr dt ds arðs asar)! \ds ar at ds 
Using the notation of the Jacobian, we obtain 


ay, a(x, a(x, 
Neat ee i L E (7) 
a(s, t) a(s, t) a(s, t) 
This alternative formula for the normal vector to a smooth parametrized surface 
will prove useful to us on occasion. For the moment, we take its magnitude: 


e a (8A. DY 
meni = f( cn) (Gea) (Gea) 


Hence, formula (6) may also be written as 


Surface area of S 


a(x, 2 —(I@DY E (80,0) 
-ffl ka 2) i (2) (2) U 


EXAMPLE 12 Findthe surface area of the torus described in Example 5. Recall 
that the torus is parametrized as 


x = (a + b cost) cos s 
y = (a + b cost) sins 0<s,t<2m, a>b>0Q. 


z=bsint 

Thus, 
d(x, y) _ | —(a+bcost)sins —bsint coss 
a(s,t) (a+bcost)coss —b sint sins 


= (a + bcost)(b sint sin? s + b sint cos? s) 


= (a + bcost)bsint, 


d(x,z) _ | —(a+bcost)sins —bsint coss 
a(s,t) 0 bcost 


= —(a + bcost)b cost sins, 
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and 


aye) 
a(s, t) E 


(a+bcost)coss —bsint sins 
0 b cost 


= (a + b cost)b cost coss. 
By formula (8), we have 


Surface area 


27 2T 
= / f Ja + b cos t)2[b2} sin? t + b? cos? t sin? s + b? cos? t cos? s] ds dt. 
o Jo 


Using the trigonometric identity cos? 0 + sin? @ = 1 twice, we simplify the inte- 
gral to 


27 27 27 
/ (a+bcost)bds dt = | 2rb(a + bcost) dt 
0 Jo 0 


= 2nb(at + bsint)|.” 
= Arab. + 


EXAMPLE 13 Suppose that a smooth surface is described as the graph ofa C! 
function f(x, y), thatis, by the equation z = f(x, y), where (x, y) varies through a 
plane region D. Then the standard parametrization X(s, t) = (s, t, f(s, t)) implies 


T, xT, =—-fpi- fj+k. 
(See Example 9.) Formula (6) yields 


Surface area = |f IT, x T;|| asat= | | VJ f2t+ ff? +1dsdt. 
D D 


Since x = s, y = t in this parametrization of the graph, we conclude that 


Surface area of the graph of f(x, y) over D 


= | | [RFR 1axay 


+ 


One final note: It is not at all clear that either formula (6) or formula (8) 
depends only on the underlying surface S = X(D) and not on the particular 
parametrization X. These formulas are independent of the parametrization, as we 
shall observe in the following section, in the context of general surface integrals. 


1. Let X: R? — R? be the parametrized surface given by (b) Find an equation for the plane tangent to this surface 


X(s,t) = ($? —t?,s +t, s? +31). 


(a) Determine a normal vector to this surface at the 


point 


at the point (3, 1, 1). 


2. Find an equation for the plane tangent to the torus 
X(s, t) = ((5 + 2cost)coss, (5 + 2 cost) sins, 2 sint) 


(3, 1,1) = X(2, —1). at the point (6 — /3)/V2, (5 — V3)/V2, 1). 
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. Find an equation for the plane tangent to the surface 


x=e, y=Prer, z=2e*+t 


at the point (1, 4, 0). 


. Let X(s, t) = (s? cost, s? sint, s), —3 < s <3, 0 < 


t < 27. 
(a) Find a normal vector at (s, t) = (—1, 0). 
(b) Determine the tangent plane at the point (1, 0, —1). 


(c) Find an equation for the image of X in the form 
F(x, y,z) = 0. 


. Consider the parametrized surface X(s, t) = (s,s? + 


t, t°). 

(a) Graph this surface for —2 < s < 2, —2 < t <2. 
(Using a computer may help.) 

(b) Is the surface smooth? 


(c) Find an equation for the tangent plane at the point 
(1,0,1). 


. Describe the parametrized surface of Exercise 1 by an 


equation of the form z = f(x, y). 


. Let S be the surface parametrized by x = s cost, 


y = s sint, z = s?, wheres > 0,0 < t < 27. 
(a) At what points is S smooth? Find an equation for 
the tangent plane at the point (1, v3, 4). 


(b) Sketch the graph of S. Can you recognize S as a 
familiar surface? 


(c) Describe S by an equation of the form z = f(x, y). 


(d) Using your answer in part (c), discuss whether S 
has a tangent plane at every point. 


. Verify that the image of the parametrized surface 


X(s, t) = (2sins cost, 3 sins sint, coss), 


O<s<z, 0<t<2xr, 


is an ellipsoid. 


Verify that, for the torus of Example 5, the s-coordinate 
curve, when ft = tọ, is a circle of radius a + b cos tọ. 


The surface in R? parametrized by 
X(r,0)= (r cos8,r sin, 0), r> 0, —œ <0 <œ, 


is called a helicoid. 

(a) Describe the r-coordinate curve when 0 = 7/3. 
Give a general description of the r-coordinate 
curves. 


(b) Describe the -coordinate curve when r = 1. 
Give a general description of the 6-coordinate 
curves. 

(c) Sketch the graph of the helicoid (perhaps using 
a computer) for 0 < r < 1,0 < 0 < 4m. Can you 
see why the surface is called a helicoid? 


11. 


Given the sphere of radius 2 centered at (2, —1, 0), 
find an equation for the plane tangent to it at the point 
(1, 0, /2) in three ways: 


(a) by considering the sphere as the graph of the 


function 
fe. y) = V4- œ = 2? -O + Ds 
(b) by considering the sphere as a level surface of the 
function 


F(x,y, z) = œ% -27 +0 +17 +2; 


(c) by considering the sphere as the surface para- 
metrized by 


X(s, t) = (2sins cost + 2,2sins sint — 1, 2 cos s). 


In Exercises 12-15, represent the given surface as a piecewise 
smooth parametrized surface. 


12. 


13. 


14. 


15. 


16. 


The lower hemisphere x? + y? + z? = 9, including the 
equatorial circle. 


The part of the cylinder x? + z? = 4 lying between 

y=—landy=3. 

The closed triangular region in R? with vertices 

(2, 0, 0), (0, 1, 0), and (0, 0, 5). 

The hyperboloid z? — x? — y? = 1. (Hint: Use two 

maps to parametrize the surface.) 

This problem concerns the parametrized surface 

X(s, t) = (87, ©, st). 

(a) Find an equation ofthe plane tangent to this surface 
at the point (1, —1, —1). 

(b) Is this surface smooth? Why or why not? 


<p (c) Use a computer to graph this surface for —1 < s < 


TAC) 


17. 


1l,-l<t<l. 

Verify that this surface may also be described by 
the xyz-coordinate equation z = ¿/xy. Try using 
a computer to graph the surface when described in 
this form. Many software systems will have trou- 
ble, or will provide an incomplete graph, which is 
one reason why parametric descriptions of surfaces 
are desirable. 


The surface given parametrically by X(s,t)= 


(st, t, s?) is known as the Whitney umbrella. 
(a) Verify that this surface may also be described by 
the xyz-coordinate equation y*z = x’. 


(b) Is X smooth? 


È (c) Use a computer to graph this surface for —2 < s < 


2,—2<t <2. 

(d) Some points (x, y, z) of the surface do not cor- 
respond to a single parameter point (s, t). Which 
ones? Explain how this relates to the graph. 


(e) Give an equation of the plane tangent to this sur- 
face at the point (2, 1, 4). 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


(f) Show that at the point (0, 0, 1) on the image of X 
it’s reasonable to conclude that there are two tan- 
gent planes. Give equations for them. 


Let S be the surface defined as the graph of a function 
f(x, y) of class C!. Then Example 4 shows that S is 
also a parametrized surface. Show that formula (5) for 
the tangent plane to S at (a, b, f(a, b)) agrees with that 
of formula (4) in §2.3. 


(a) Write a formula for the tangent plane to a surface 
described by the equation y = g(x, z). 

(b) Repeat part (a) for a surface described by the equa- 
tion x = h(y, z). 


Suppose X: D —> R? is a parametrized surface that is 
smooth at X(so, to). Show how the definition of the 
derivative DX(so, to) (see Definition 3.8 of Chapter 2) 
can be used to give vector parametric equations for the 
plane tangent to S = X(D) at the point X(so, to). 


Use the result of Exercise 20 to provide parametric 
equations for the plane tangent to the surface X(s, t) = 
(s, s? + t, t°) at the point (1, 0, 1). Verify that your an- 
swer is consistent with that of Exercise 5(c). 


Use the parametrization in Example 3 to verify that the 
surface area of a cylinder of radius a and height h is 
2mah. 


Let D denote the unit disk in the st-plane. Let X: D > 
R? be defined by (s + t, s — t, s). Find the surface area 
of X(D). 
Find the surface area of the helicoid 

X:D > R?, X(r,0) =(rcosd,r sind, 6) 
for 0<r <1, 0 <6 <2zn, where n is a positive 
integer. 


A cylindrical hole of radius b is bored through a ball 
of radius a (> b) to form a ring. Find the outer surface 
area of the ring. 


Find the area of the portion of the paraboloid z = 
9 — x? — y? that lies over the xy-plane. 


Find the area of the surface cut from the paraboloid 
z = 2x? + 2y? by the planes z = 2 and z = 8. 
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28. Calculate the surface area of the portion of the plane 


29. 


30. 


31. 


32. 


33. 


x + y + z =a cutout by the cylinder x? + y? = a’ in 
two ways: 

(a) by using formula (6); 

(b) by using formula (9). 


Let S be the surface defined by the equation z = 
f(x,y). If f? + f? =a, where a is a positive con- 
stant, determine the surface area of the portion of S 
that lies over a region D in the x y-plane in terms of the 
area of D. 


Let S be the surface defined by 


forz > 1. 


_ 1 
Ohya y? 
(a) Sketch the graph of this surface. 


(b) Show that the volume of the region bounded by S 
and the plane z = 1 is finite. (You will need to use 
an improper integral.) 

(c) Show that the surface area of S is infinite. 


Find the surface area of the intersection of the cylinders 
x? + y? =a’? and y? + z? =a’. 


Suppose that a surface is given in cylindrical coordi- 
nates by the equation z = f(r, 0), where (r, 0) varies 
through a region D in the r@-plane where r is non- 
negative. Show that the surface area of the surface is 
given by 


af? 1 (ary 
1 — =| = dr dé. 
ff +(¥) Ta (sr) ad 
Suppose that a surface is given in spherical coordi- 
nates by the equation p = f(g, 0), where (p, 0) varies 
through a region D in the g@-plane and f(g, @) is non- 


negative. Show that the surface area of the surface is 
given by 


J| to 


x y (F0, 0? + fol, 8?) sin? p + folo, 8 do dd. 


7.2 Surface Integrals 


In this section, we will learn how to integrate both scalar-valued functions and 
vector fields along surfaces in R*. We proceed in a manner that is largely analo- 
gous to our explorations of line integrals in $6.1: We begin by defining suitable 
integrals over parametrized surfaces and then establish that the particular choice of 
parametrization doesn’t much matter—that, really, only the underlying surface is 
important, and possibly the orientation. 
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Area AA, 


ei; 


Ck 


Figure 7.14 A small piece 
of the surface S has area 
AAx. The point c is located 
in this surface piece. 


Scalar Surface Integrals 


Suppose S is a bounded surface in R? and f(x, y, z) isa continuous, scalar-valued 
function whose domain includes S. Then we want the surface integral ff s f dS to 
be a limit of some kind of Riemann sum. So suppose S is partitioned into finitely 
many small pieces and that the area of the kth piece is AA,;. Let ce, denote an 
arbitrary “test point” in the kth piece. (See Figure 7.14.) Then the surface integral 
of f over S should be 


| [ fas= lim Z fe)aAn (1) 


provided, of course, that this limit exists. 

Now, we add some formalism to provide a proper definition. Suppose that 
S is a smooth parametrized surface; that is, suppose that S is the image of the 
C! map X: D — R?, where D is a connected, bounded region in R*. Let f bea 
continuous function defined on S = X(D). As seen in §7.1, the small rectangle 
in D, having dimensions As and At with lower left corner at the point (so, fo), is 
mapped by X to a piece of surface that is approximately a parallelogram of area 


AA X ||T; (so, to) x T:(so, to) || As At. 
(See Figure 7.13, page 464.) Suppose D is the rectangle [a, b] x [c, d] and that 
we partition D by 
ad=S <8, < <S =b and c= <t <: <tr =d. 
Then the limiting sum that formula (1) represents is 


$ * * 
m mee FX, HY Tia tj-1) xX Telsizi 47-1 As: Atj, 2) 


where 
x * 
As; = Si — Sj-1, At; = tj —tj-1, Si-1 SS; < Si, and tj- St; < tj. 


(Thus, X(s;", t7) is an arbitrary point in the image of the subrectangle [s;—1, 5;] x 
[tj-1, tj] and, hence, a “test point” in the corresponding small surface piece.) But 


then the limit in formula (2) is 


d b 
J J FXG, ODT, x Tr ds dt. 


When D is a more arbitrary region than a rectangle, it makes sense to use the 
following definition for the surface integral of a function over a parametrized 
surface: 


DEFINITION 2.1 LetX: D —> R? be a smooth parametrized surface, where 
D C R? is a bounded region. Let f be a continuous function whose domain 
includes $ = X(D). Then the scalar surface integral of f along X, denoted 


STS 
[fras= fff f(X(s,1)) ITs x Til] ds dt 


= FXG, t) ING, Dll ds dt. 
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Although we need not assume that the map X is one-one on D in order to 
work with the integral in Definition 2.1, in practice we usually find it useful to 
take X to be one-one, except perhaps along 0D. If this is the case, and if f is 
identically 1 on all of X(D), then 


Tf sas=Jf 1as= ff IT, x T,|| ds dt = surface area of X(D), 
x x D 


as stated by formula (6) in §7.1. The scalar surface integral in Definition 2.1 is thus 
a generalization of the integral we use to calculate surface area. We can think of 
ffy f dS as the limit of a “weighted sum” of surface area pieces, the weightings 
given by f.If f represents mass or electrical charge density, then ffy f dS yields 
the total mass or total charge on X(D) (assuming X is one-one, except perhaps 
along ð D). 

For computational purposes, recall that if we write the components of X as 


X(s, t) = (x(s, t), y(s, t), z($, t)), 
then 


A(y,z), (x, 2), _ 3x, y) 


MeS LaS n Pa eA 


We obtain 


Jfras=]f f(x(s,t), y(s,t), z(s, 0) 


a(x, y) > a(x, Zz) z a(y, z) 2 
TET H (2) | (>) dsdt. (3) 


EXAMPLE 1 We evaluate ffy z> dS, where X: [0, 27] x [0, 2] > R? is the 
parametrized sphere of radius a: 


X(s, t) = (a coss sint,asins sint, a cost). 


Using Definition 2.1 or its reformulation in formula (3), we find that 
Ne, oj = ELY LEY 4 eV 
? as, D) aED) | \as,n 


= Jat(sin? t cos? t + sin? s sinf t + cos? s sin‘ t) 


= a? V sin? t cos? t + sinf t 


= a? jsir? t (cos? t + sin? t) 
=a’ sint. 
(See also Example 11 in §7.1.) Hence, 


T 2r T 
Tf dS =) (a cost)a’ sint ds dt Sa) 2x cos’ t sint dt 
x o Jo 0 


= 2ra (—}oos41)|5 = 2ra" (-4 — (—4)) =0. : 
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z 
S3:z=15 


Sx? +y =9 


Figure 7.15 The closed 
cylinder of radius 3 and 
height 15 of Example 2. 


To define and evaluate scalar surface integrals over piecewise smooth para- 
metrized surfaces, simply calculate the surface integral over each smooth piece 
and add the results. 


EXAMPLE 2 Let S be the closed cylinder of radius 3 with axis along the z-axis, 
top face at z = 15, and bottom face at z = 0, as shown in Figure 7.15. Then S$ 
is a piecewise smooth surface; it is the union of the three smooth parametrized 
surfaces $1, S2, and S3 described next. We calculate f f gzd8. 

The three smooth pieces may be parametrized as follows: 


x =3coss 
Sı (lateral cylindrical surface): y=3sins O<s<2n, 0<1t< 15, 
Tat 
X = S COS Í 
S2 (bottom disk): y=ssint 0<s<3, O0<t<027, 
z=0 
and 
x =scost 
S3 (top disk): y=ssint O<s <3, 0<t<2r. 
z= 15 


Using Definition 2.1, we have 


15 2n 
i cas = | | t ||(—3 sin s i + 3 cos s j) x k|| ds dt 
Si o Jo 


15 2x 
=) | t ||3 sin s j + 3 cos s il] ds dt 
o Jo 


15 p2r 15 15 
= | srdsat = | 6nt dt = 3nt?|, = 675x. 
0 0 0 


Now, f f s Z dS = 0, since z vanishes along the bottom of S. For $3, we have 


Tf zas= | | isas=15- | f lds 
S3 S3 S3 


= 15 . area of disk = 15-(97) = 135x. 


Jfas= f| zas+ ff zas+ ff zas 


= 675m + 0 + 1357r = 8107. , 


Therefore, 


If a surface S is given by the graph of z = g(x, y), where g is of class C! on 
some region D in R?, then S is parametrized by X(x, y) = (x, y, g(x, y)) with 
(x, y) € D. (See Example 4 of §7.1.) Then, from Example 13 in §7.1, 


N(x, y)= —g,i- g&yj +k, 


Jf tas= ff 1y ee Dte axa. (4) 


so that 
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EXAMPLE 3 Suppose S is the graph of the portion of the paraboloid z = 
4 — x? — y?, where (x, y) varies throughout the disk 
D={(x,y)€R? | x? +y? <4}. 


(See Figure 7.16.) Formula (4) makes it straightforward, although rather involved, 
to calculate 


ffo —z)dS, where X(x, y) = (x, y, 4 — x? — y?). 
x 


x 


Figure 7.16 The graph In particular, we have 
of 4 — x* — y? over the 


disk D of radius 2. [[e-aas= [| (4—(4—x? — y?)) V4x? + 4y? + ldx dy 
x D 
=f (x? + y?)/4x2 + 4y? + 1 dx dy. 
D 


To integrate, we switch to polar coordinates; that is, we let x = r cos@ and y = 
rsin@, where 0 <r < 2,0 < 0 < 2x. The desired integral becomes 


2n p2 2 pda 
i | Vari rarae = | J r°y4r? + 1d6 dr 
0 0 0 J0 
2 
= ax f r/4r? + 1 dr, 
0 


by Fubini’s theorem. Now let 2r = tanu; that is, let r = 5 tanu so that dr = 
5 sec? u du. The previous integral transforms into 


tan! 4 
1 1 
ax f g tam we vtan?u + 1+ 5 sec? udu 
0 


tan7! 4 
_ 3 3 
== tan” usec” u du 
8 Jo 


x tan“! 4 
= =f tan? u sec? u + (sec u tan u du) 
0 
tan“! 4 
ld 2 2 
Ji7/ l4 qe (sec u — 1)sec* u sec u tan u du. 
0 


Now, let w = secu so dw = secu tan u du. Hence, when u = 0, w = 1 and when 
u = tan™! 4, w = y 17. (See Figure 7.17.) Thus, the u-integral becomes 


x evi xn pv 
1 =f (w? — l)w* dw = =| (wt — wdw 
8 1 8 1 
Figure 7.17 Ifu = tan"! 4, 
then secu = /17. x/l 5 1 3 “a 
= — |-w°— -w 
8 \5 3 1 


(E) 


(er) 
— | i. 
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Alternatively, we could calculate the integral R r>./4r2 + 1dr using integra- 
tion by parts with u =r? (so du = 2r dr) and dv = rV/4r? + ldr (so v= 
Br? + 19⁄2). T 


Vector Surface Integrals 


Now we develop a means to integrate vector fields along surfaces, beginning with 
a definition. 


DEFINITION 2.2 LetX: D —> R? be a smooth parametrized surface, where 
D is a bounded region in the plane, and let F(x, y, z) be a continuous vector 
field whose domain includes S$ = X(D). Then the vector surface integral 
of F along X, denoted ffy F- dS, is 


[[#-4s= ff ro, t))+N(s, t) ds dt, 


where NA(S D — lin <ul 


As with line integrals, you are cautioned to be careful about notation for surface 
integrals. In the vector surface integral ffy F - dS, the differential term should be 
considered to be a vector quantity, whereas in the scalar surface integral ff. xi aS, 
the differential term is a scalar quantity (namely, the differential of surface area). 


EXAMPLE 4 LetF = xi+ yj+(z—2y)k. We evaluate ffy F + dS, where X 
is the helicoid 


X(s,t) = (s cost, s sint,t) O<s<1l, O<t<2z. 


The helicoid is shown in Figure 7.18. 
We have 


Ay), 92, Gy), 
a(s,t)  a(s,t)°  a(s,t) 


N(s, t) = 


sin t scost 
0 1 


cost —s sint 
sinź scost 


cost —s sint 
0 1 


= sinti — cost j+ sk. 


Using Definition 2.2, we obtain 


[[F-4 = [7 [ rasne, t)ds dt 


20 1 
Figure 7.18 The = i / (scosti+ ssintj 
helicoid of o oat 
Example 4. + (t — 2s sin t) k). (sint i — cost j + s k)ds dt 


20 1 2x 1 
2 2 _ de. BB 
=] Í (st — 2s sinn)ds dt = f (557t — $9 sint)| odt 


20 
= (it — $ sint) dt = (41? + 2 cost) |7 = x’. + 
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EXAMPLE 5 Let f(x, y) beascalar-valued function of class C! on a bounded 
domain D C R?. Suppose S is the surface described as the graph of z = f(x, y); 
that is, S = X(D), where X(x, y) = (x, y, f(x, y)). Then 


N(x, y) = —fri- fyj +k, 


so that Definition 2.2 becomes 


[|a= [res fom Chi- fit) dedy © 


Formula (5) will prove to be quite useful. + 


Further Interpretations 


As is the case for vector and scalar line integrals, there is a connection between 
vector and scalar surface integrals. Suppose X: D —> R? is a smooth parametrized 
surface and F is continuous on S = X(D). Let N(s, t) = T, x T, be the usual 
normal vector and let 


N(s, t) 
IN(s, ÐI 
That is, n is the unit vector pointing in the same direction as N. In particular, 
N(s, t) = IIN(s, £)I| n(s, £). 


Using Definition 2.2, we have that the vector surface integral is 


JAE = [ [ Fexo.mno.nasa 


a / [ F(X(s, ©) + (IIN(s, t)|Im(s, ©) ds dt 


n(s, t) = 


>f F(X(s, t) -n(s, DINGS, £| ds dt 
D 


= | [e-mas, (6) 


Since n is a unit vector, the quantity F- n is precisely the component of F in the 
direction ofn. In other words, formula (6) says that the vector surface integral of F 
along X is the scalar surface integral of the component of F normal to S = X(D). 
It is the surface integral analogue of formula (3) of 86.1, which states that the 
vector line integral of F along a path x is the scalar line integral of the component 
of F tangent to the image curve. To summarize, we have the following results: 


Line integrals: 


[¥-as= [ena 


Surface integrals: 


[[ #-4s= [[@-mas. 
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F(X(uo, Vp) )At 


Figure 7.19 The amount of fluid transported 
across a small piece of S during a brief time 
interval At may be approximated by the volume 
of a parallelepiped. 


As noted in §6.1, when x is a closed path, the quantity xe - T) ds in equation 
(7) is called the circulation of F along x. It measures the tangential flow of F 
along the path. On the other hand, the quantity f Ix (F -n)dS in equation (8) is 
known as the flux of F across S$ = X(D). If we think of F as the velocity vector 
field of a three-dimensional fluid, then the flux may be thought of as representing 
the rate of fluid transported across S$ per unit time, as we now see. (You may 
wish to compare the following discussion with the one in §6.2 concerning the 
two-dimensional flux across a curve.) 

To avoid notational confusion, we use u and v to denote the parameter vari- 
ables for X and ¢ as the variable representing time. Consider a small piece of S, 
having area AS, and the amount of fluid transported across it during a brief time 
interval At. This amount is the volume determined by F during Af. Figure 7.19 
suggests that if both AS and At are sufficiently small, then this volume can be 
approximated by the volume of an appropriate parallelepiped. Therefore, 


Amount of fluid transported ~ volume of parallelepiped 
= (height) (area of base) 
= F(X(uo, vo))At + n(uo, vo)AS, (9) 


since the height of the parallelepiped is the normal component of FAt. We obtain 
the average rate of transport across the surface piece during the time interval At 
by dividing (9) by At: 


Average rate of transport ~ F(X(uo, vo)) + n(uo, vo)AS. (10) 


Now, break up the entire surface S = X(D) into many such small pieces and sum 
the corresponding contributions to the rate of transport in the form given in (10). 
If we let all the pieces shrink, then, in the limit as all AS — 0, we have that the 
total average rate AM /At of fluid transported during At is approximately 


AM 
aa ~ ff Emas. 
At s 


Finally, let At — 0 and define the (instantaneous) rate of fluid transport to be 


T = | L@-mas. 
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Reparametrization of Surfaces 


As seen in 86.1, scalar and vector line integrals over curves depend on the geom- 
etry of the curve (and possibly its direction), rather than on the particular way in 
which the curve may be parametrized. Much the same is true for surface integrals. 
We begin with a definition, analogous to Definition 1.3 of Chapter 6. 


DEFINITION 2.3 Let X: Dı C R? —> R? and Y: Dy C R? — R? be para- 
metrized surfaces. We say that Y is a reparametrization of X if there 
is a one-one and onto function H: D) > D, with inverse H~!: D; > D> 
such that Y(s, t) = X(H(s, t)), that is, such that Y = X o H. If X and Y are 
smooth and H and H~! are both of class C!, then we say that Y is a smooth 
reparametrization of X. 


EXAMPLE 6 The helicoid parametrized by 


x =scost 
y=ssnt O<s<1, O0<t<2z 
z=t 


may also be described as 


sS 
x = — cos 2t 
2 


y = 5 sin2r O<s<2, 0<t<z. 


Z=2t 


The first description corresponds to a map X: [0, 1] x [0, 277] —> R? and the sec- 
ond to a map Y: [0, 2] x [0, m] —> R. It is not difficult to see that if we make the 
change of variables by letting 


s 
=— and = 2i; 
u 5 v 


then 
Y(s, t) = X(u, v). 
Equivalently, we can define a function H: [0, 2] x [0, z] — [0, 1] x [0, 27 ] with 


H(s, t) = (s /2, 2t). Then H is one-one and onto and Y = X o H. Therefore, Y is 
a reparametrization of X. + 


EXAMPLE 7 Suppose X is a smooth parametrized surface. Let Y(s, t) = 
X(u, v), where u = t, v = s. That is, Y = X o H, where H(s, t) = (t, s). Then 
Y is a (smooth) reparametrization that appears to accomplish little. However, if 
we let Ny denote the usual normal vector T, x T, = dY/ds x 0Y/dt, then we 
have 
ðY Ox ðY dx 
p = an — =; 
as dv ot Ou 
so that 
oY ƏY oOX Ox oX dx 
x = xX = x 


Ny = 
Ry ot ðv ðu Ou dv 


= Nx. 
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The parametrized surface Y is the same as X, except that the standard nor- 
mal vector arising from Y points in the opposite direction to the one arising 
from X. > 


The calculation in Example 7 generalizes thus: Suppose X is a smooth 
parametrized surface and Y is a smooth reparametrization of X via H, mean- 
ing that 


Y(s, t) = X(u, v) = X(H(s, t)). 


Since H is assumed to be of class C!, we can show from the chain rule that the 
standard normal vectors are related by the equation 
N a) ll 

y(s, t) = TE x(u, v). (11) 
(See the addendum at the end of this section for a derivation of formula (11).) 
Formula (11) shows that Ny is a scalar multiple of Nx. In addition, since H is 
invertible and both H and H~! are of class C!, it follows that the Jacobian of H 
is either always positive or always negative. (To see this, note that both H o H~! 
and H~! o H are the identity function. Hence, the chain rule may be applied to 
show that the derivative matrix DH(s, t) is invertible for each (s, t); therefore, its 
determinant, which is the Jacobian of H, must be nonzero. Since the determinant 
is a continuous function of the entries of H, it thus cannot change sign.) Hence, 
the standard normal Ny either always points in the same direction as Nx or else 
always points in the opposite direction (Figure 7.20). Under these assumptions, we 
say that both H and Y are orientation-preserving if the Jacobian (u, v)/0(s, t) 
is positive, orientation-reversing if (u, v)/0(s, t) is negative. 


ee 


t D, 
F N, S=X(D)=YD)) 
ee 
| AY 


Figure 7.20 If Y is an orientation-reversing reparametrization of X, then Ny 
points opposite to Nx. 


The following result, a close analogue of Theorem 1.4, Chapter 6, shows that 
smooth reparametrization has no effect on the value of a scalar line integral. 


THEOREM 2.4 Let X: Dı —> R? be a smooth parametrized surface and f any 
continuous function whose domain includes X(D)). If Y: D2 —> R? is any smooth 
reparametrization of X, then 


fpes- [fe 
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The proof of Theorem 2.4 appears in the addendum to this section. With this 
result, we can define the scalar surface integral over a smooth surface S by taking 
a smooth parametrization X: D > R? with S = X(D) that is one-one, except 
possibly on 0D. Then we define the scalar surface integral of f on S by 


fhs- [fv 


It is a fact (which we shall not prove) that any two smooth parametrizations of 
S must be reparametrizations of each other, so Theorem 2.4 tells us that any 
particular choice of parametrization we might make does not matter. We need to 
assume that X is (nearly) one-one to ensure that the integral is taken only once 
over the underlying surface $ = X(D). It is also a straightforward matter to extend 
these comments to give a definition of a scalar surface integral of a function over 
a piecewise smooth surface. 

Analogous to Theorem 1.5 of Chapter 6, the following result (whose proof 
is in the addendum) tells us that smooth reparametrizations only affect vector 
surface integrals by a possible sign change. 


THEOREM 2.5 Let X: Dı —> R? be a smooth parametrized surface and F any 
continuous vector field whose domain includes X(D,). If Y: D > R? is any 
smooth reparametrization of X, then either 


Jf F-as= ff F-as, 


if Y is orientation-preserving, or 


[[ ¥-as=- ff Fas, 


if Y is orientation-reversing. 


Because of Theorem 2.5, it is a more subtle and involved matter to define 
a vector surface integral over a smooth surface than to define a scalar surface 
integral. Given a smooth, connected surface, we need to choose an orientation 
for it. This is akin to orienting a curve but, perhaps surprisingly, is not always 
possible, even for a well-behaved, smooth parametrized surface, as Example 8 
illustrates. 

Here is a formal definition of orientability of a smooth surface. 


DEFINITION 2.6 A smooth, connected surface S is orientable (or two- 
sided) if it is possible to define a single unit normal vector at each point of 
S so that the collection of these normal vectors varies continuously over S. 
(In particular, this means that nearby unit normal vectors must point to the 
same side of S.) Otherwise, S is called nonorientable (or one-sided). 


It is a fact (clearly suggested by Figure 7.21) that a smooth, connected, ori- 
entable surface S has exactly two orientations. 

If S happens to be the image X(D) ofa smooth parametrized surface X: D > 
R?, then the normal vectors 


N:=T, xT, and N: =T; xT, = —N; 
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Figure 7.21 The connected orientable surface S shown 
with its two possible orientations. 


can be used to give unit vector normal vectors nj = N;/||Nj|| and n2 = N2/||No|| 
that point in opposite directions. It is tempting to think that n; and n always 
provide two orientations for S. However, even though both n; and n) may vary 
continuously with respect to the parameters s and f, it is not clear that they must 
vary continuously and consistently with respect to the points on the underlying 
surface S. Example 8 is a famous instance of a nonorientable surface. 


EXAMPLE 8 The surface parametrized by 


S 
= | 1 +t cos =) cos 
XxX ( 7 AY 


S : 
y = (141008 =) sins jara n, = latal 
2 2 2 
t si i 
zZ=tsin= 
2 
Z is called a Möbius strip. It may be visualized as follows: The t-coordinate curve 
at s = So 1s 
S0 
x= (cos SQ COS >) t + COS So 
n SO A 1 1 
y= Sin so COS 5 t + sin So = 5 St S55, 


: z) 
— Aan t 
Z= (sin 


This is a line segment through the point (cos sọ, sin sọ, 0) and parallel to the vector 


So\. . So\. . (50 
Figure 7.22 Some t-coordinate eee (>) ee N PAR (3) i 
curves of the parametrized Möbius : : i . ; F 
strip. Several such coordinate curves, marked with the direction of increasing t, are 


shown in Figure 7.22. We see that the Mobius strip is generated by a moving line 
segment that begins (at s = 0) lying along the positive x-axis, rises to a vertical 
position with center at (—1, 0, 0) whens = zr, and then falls back to horizontal, but 
with direction reversed ats = 27 . The s-coordinate curve att = 0 is parametrized 
by 


x = coss 
y = sins O0<s <2n, 
z=0 


x 


and so is a circle in the xy-plane. The full Möbius strip is shown in Figure 7.23. 
Figure 7.23 The Möbius strip of You can make a physical model by taking a strip of paper, giving it a half-twist, 
Example 8. and joining the short ends. 


Figure 7.25 Traveling once 
around the circular path on the 
Möbius strip forces the normal 
vector to reverse direction. 
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You can understand the gluing process analytically by noting that the map 
X:[0, 27] x [-5. 4] > R? 


defining the Mobius strip as a parametrized surface has the property that X(0, t) = 
X(2z, —t) but is otherwise one-one. Therefore, every point (0, t) on the left edge 
of the domain rectangle [0, 2x | x [—-4, 5] is mapped to the point (1 + t, 0, 0) of 
the Mobius strip, as is the point (27, —r) on the right edge of the rectangle. (See 
Figure 7.24.) 


t 


1/2 
a 
-1/2 


20 


Figure 7.24 Gluing the ends of a strip of paper so that the arrows align 
provides a model of the Mobius strip. 


Now, let’s investigate the orientability of the Mobius strip. The standard nor- 
mal vector is 


Oz), 9), Wy), 
ir we aE A 


N(s, t) = T, x T, = 


= E (coss +2t (cos*= — cos =))i 
2 2 2 


1 
5 (4008 5 4cos* 5 + t (1 + coss — cos? ))i 


eos” (1 | t cos >) k 
ok Cy a 


We have 
t 
N(0, t) = zÍ — (1 +ż)k, 
and 


t 
N(2x, —t) = zl + (1+f)k = -—N(0,ż). 
Therefore, a uniquely determined normal vector has not been defined. More 
vividly, imagine traveling along the Möbius strip via the s-coordinate path at 
t = 0, that is, along the circular path 


x(s) = X(s, 0) = (cos s, sins, 0), O<s <2z. 


Follow the standard normal N. At s = 0, it is N(0, 0) = —k, but by the time 
we close the loop, it is N(2z,0) =k. This apparent reversal of the normal 
vector means that the strip is not orientable at all—it is one-sided. (See 
Figure 7.25.) + 


A smooth, orientable surface together with an explicit choice of orientation 
for it is called an oriented surface. If S is such a smooth oriented surface, then 
we define the vector surface integral of F along S by finding a suitable smooth 
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Figure 7.27 The sphere 
x? + y? +z? = a? oriented 
by outward-pointing unit 
normal vectors. 


parametrization X of S such that the unit normal vector N(s, t)/||N(s, ¢)|| arising 
from the parametrization agrees with the choice of orientation normal. We take 


the surface integral to be 
[[#-as= ff F-as. 
s x 


By Theorem 2.5, if Y is any orientation-preserving reparametrization of X, the 
value of ffy F -dS is the same as ffy F-dS, so this notion of a surface inte- 
gral over the underlying oriented surface S is well-defined. Even though we may 
perfectly well calculate ffy F -+ dS, where X is the parametrized Möbius strip of 
Example 8, it does not make sense to consider the surface integral over the under- 
lying Möbius strip, since there is no way to orient it. Similarly, the interpretation 
of the vector surface integral as the flux of F across the surface only makes sense 
once an orientation of the surface is chosen. Then the flux measures the flow rate, 
positive or negative, depending on the choice of orientation. (See Figure 7.26.) 


n 


dat 
sig 


S 


Figure 7.26 How flux depends on orientation. On the 
left, the surface S is oriented by unit normal vectors so 
that F - n is positive at every point. Hence, SIs F -dS = 
Jf F -nds is positive. On the right, S is given the 
opposite orientation so that the flux ff, F +dS < 0. 


Another reason for de-emphasizing the role of parametrization in surface 
integrals is that we can often exploit the geometry of the underlying surface and 
vector field when making calculations. If S is a smooth, orientable surface and na 
unit normal that gives an orientation of S (so, in particular, n is understood to vary 
with the points of S), then, for a continuous vector field F defined on S, we have 


[[8-4s= f| F-nas. 


If we can determine a continuously varying, unit normal vector at each point of 
S (for example, if S is the graph of a function f(x, y) of two variables or the 
graph of a level set f(x, y, z) = c of a function of three variables), then there is 
a good chance that the surface integral can be evaluated readily. 


EXAMPLE 9 Let F = xi+ yj + zk bea radial vector field, and suppose S is 
the sphere of radius a with equation x? + y? + z? = a?. Orient S by outward- 
pointing unit normal vectors as shown in Figure 7.27. We calculate the flux of 
F across S in two ways: (1) by means of a parametrization of S and (2) via 
geometric considerations, that is, without resorting to an explicit parametrization 
of the sphere. 

For approach (1), use the usual parametrization X of the sphere: 


x =acoss sint 
y =asins sint O0<s<2n, 0O<t<az. 
Z=acost 
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The standard normal vector for this parametrization is 
Ns, t) = —a’ sint (coss sinti+ sins sint j+ cost k). 


(This normal vector is calculated in Example 11 of §7.1.) If we normalize N, we 
find that 
Ga N(s, t) ( Wata ae tk) 
n(s, t) = ———— = —(coss sinti + sins sint j + cost k). 
IIN(s, AIl 


Thus, n is inward-pointing at every point on the sphere. Therefore, we must 
make a sign change when we evaluate the vector surface integral, if we use the 
parametrization just given. Hence, we have 


F-as=- [f r.as-- f 7 PXG, ©) N(s, t) ds di 
S X 0 0 


x 2x 
=- f (acoss sinti +asins sint j+ a cost k) 
o Jo 


. (—a° sint (coss sinti + sins sint j + cost k) ds dt 


m 27 
=a / i sin t (cos? s sin’ t + sin’ s sin? t + cos? t)ds dt 
0 Jo 


bd 2n m 
= a f / sintds dt = 2a’ f sint dt = 4ra’. 
0 JO 0 


Now, reconsider this calculation along the lines of approach (2). Since S is defined 
as a level set of the function f(x, y, z) = x? + y? +z’, normal vectors can be 
obtained from the gradient: 


V +y + 2’) = 2xi+ 2yj+2zk. 
If we normalize the gradient, then we have unit normal vectors. Thus, 
2xi+2yj+2zk 2xi+2yj+2zk xi+yj+zk 
"Veta tae Wty re a 


2 


because x? + y? +z? =a? at points on S. (Note that n is always outward- 


pointing.) Therefore, 


[ [#-as= ff F-nas 


TEE 
=] (xi yita: (SEE) as 
S 
2 


a 
fap 22 2 
-Js ff Sa 
S a sa 
=a]f dS = a - area of S = a(4ra’°) = 4r a°. é 
s 


All of the preceding remarks concerning scalar and vector surface integrals 
can be adapted to define integrals over piecewise smooth, connected surfaces. 
Simply add the contributions of the surface integrals over the various smooth 
pieces. The only issue is that of orientation, but assuming that each of the smooth 
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Figure 7.29 The 
piecewise smooth 
cylindrical surface S 
of Example 10 shown 
with orientation 
normals. 


pieces is orientable, then it is possible to provide an orientation to the surface 
as a whole. Here’s how: Suppose Sı and Sz are two smooth surface pieces that 
meet along a common edge curve C. Let nı and n, be the respective unit normal 
vectors that give the orientations of Sı and S2. Then n; and np each give rise 
to an orientation of C via a right-hand rule. (To see this, for j = 1, 2, point 
the thumb of your right hand along n;; the direction of your fingers will indicate 
the orientation of C.) If C receives opposite orientations from n; and no, then Sı 
and S> are oriented consistently; if C receives the same orientation, then Sı and 
S2 are oriented inconsistently. (See Figure 7.28.) 


S1 Sı 


Figure 7.28 The piecewise smooth surface S = S; U Sy 
oriented consistently on the left and inconsistently on the 
right. 


EXAMPLE 10 Weevaluate /'/,(x? i + y? j) + dS, where S is the closed cylinder 


bounded laterally by x? + y? = 4, and on bottom and top by the planes z = 0 and 
z = 5, oriented by outward normal vectors. 

Evidently, S is the union of three smooth oriented pieces: (1) the bottom 
surface S1, which is a portion of the plane z = 0, oriented by n; = —k; (2) the top 
surface Sz, which is a portion of the plane z = 5, oriented by nz = k; and (3) the 
lateral cylindrical surface S; given by the equation x? + y? = 4 and oriented by 
normalizing the gradient of x? + y along S3, namely, 


: 2xi+ 2yj xit+yj xit+tyj 
3 = => = š 
J4x2 +4y? yEy 2 


See Figure 7.29 for a depiction of S. 
Now we calculate 


J| ass f| it as+ ff ias 


+ | (x3i+ y?j)-dS 
S3 


=f &i+y j) was + f (i+ y j) kdS 
Sı S2 


-fje (222) a 
S3 


=0+0+ | tat + y*)dS. 
S3 
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To finish the evaluation, we may parametrize S3 by 


x =2coss 
y =2sins O<s<2n, O<t<5S. 
z=t 


Then 


Jhesa- ff iat + yds 


5 27 
a f 5(16 costs + 16 sinf s)2 ds dt 
0 Jo 
5 27 
= Í i 16(cos* s + sinf s) ds dt 
0 Jo 


5 2n 
= | | 16((cos? s} + (sin? s}) ds dt 
0 0 


-f [> (=) =) didi 
o Jo 2 2 


from the half-angle substitution. Thus, 


5 27 
ffeit as= f | 18(2 + 2cos? 2s) ds dt 
S 0 0 


5 20 
=| / (8 + 4(1 + cos 4s)) ds dt. 
0 J0 


By once again using the half-angle substitution, we get 
5 2 5 
[fo i+yj)-dS= | (12s + sin4s)|"" dt = / 24n dt = 120m. $ 
S 0 0 
Summary: Surface Integral Formulas 


Scalar surface integrals: 


For a surface S parametrized by X: D C R? > Rô, 
// (ISS I TISE n TOOD x Ty || ds dt. 
5 x D 
Surface area element is dS = ||T, x T,|| ds dt. 


For a surface S described as a graph of a function z = g(x, y), where g: D € 
R? > R, 


Jf tas= ff 1y 80 Dye +EP dxa. 


Surface area element is dS = y gx(x, y + gy(x, y? + 1 dx dy. 
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Vector surface integrals: 


For a surface S parametrized by X: D C R? > R?, 


[[e-as= | foe-mas= ff rs DNe dasar, 


where N = T, x T, and n = N/|INII. 


Vector surface integral element is dS = N(s, t)ds dt. 


For a surface S described as a graph of a function z = g(x, y), where g: D C 
R? > R, 


J| F-as= f [E-was 


£ J [ ce, ea, eye eae ees, isa anal 


Here n = (—g,i— gyj+k)/ E FEF 1. 


Vector surface integral element is S = (—g, i— gy j +k) dx dy. 


Addendum: Proofs of Theorems 2.4 and 2.5 
We begin by establishing formula (11) of this section. 


m Lemma Suppose X: Dı — R? isasmooth parametrized surface and Y: D2 —> 
R? is a smooth reparametrization of X via H: Dy —> Dj, where we denote H(s, t) 
by (u, v). Then the standard normal vectors Nx and Ny are related by the equation 


(u, v) 


We S 30 


Nx(u, v). 


PROOF First, we set some notation. Since Y is a reparametrization of X via H, 
we have, from Definition 2.3, that 


Y(s, t) = X(H(s, t)) = X(u, v). (12) 


Write (x(s, t), y(s, t), z(s, t)) to denote Y(s, t) and (x(u, v), y(u, v), z(u, v)) to 
denote X(u, v), even though this is a small abuse of notation. 
By formula (7) of §7.1, we have 


302). 2z), 
TOE 
If we apply the chain rule to equation (12), we obtain 


DY(s,t) = DX(u, v)DH(s, t). 


a(x, y) 
a(s,t) — 


Ny(s, t) = 


Writing out this matrix equation, we get 


Xe Xy My 
Us Ut 
Ys Yt = Yu Yv | | , 


Zs ait Zu Zv 
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where xs, xz, etc. denote partial derivatives of the component functions of Y and 
Xu, Xv, etc. are the partial derivatives of the component functions of X. It is a 
matter of performing the matrix multiplication to check that 


[ xs x, | = first row of DY(s, t) 
= first row of the product DX(u, v) DH(s, t) 
= (first row of DX(u, v))+ DH(s, t) 


= | xu ap ‘al 


Similar results hold for the second and third rows of DY(s, t). We may recombine 
these results about rows and establish the following matrix equations: 


Rae | | a. He Us Ut 
Ys Yt Yu Yv Vs v |’ 
Xs Xt | | Xu Xv Us Ut 
Zs Zt E Zu Zv Us Vr , 


Ys Vt |_| Yu XW Us Ur 
Zs Zr Zu Zv Us v f 
Taking determinants, we find that 
a(x. y) _ a(x, y) Au, v) 
a(s, t) a(u, v) 3(s, t)’ 
A(x, z) _ Ə(x, z) Ilu, v) 
A(s,t)  Ə(u, v) A(s,t)’ 


and 


and 


d(y,z) _ Ə, z) Au, v) 
a(s, t)  3(u, v) 3(s, t) ` 


Thus, returning to the original formula for Ny, we find that 


(y, z) Ou, v), Ə(x,z)Əu, v), | A(x, y) A(u, We 


Ny(s, t) = eo. alu, v) Os, iy! = A(u, v) Əs, t) 
_ d(u, v) 
= 96, 1) NM 
as desired. z 


Proof of Theorem 2.4 We use Definition 2.1 and the change of variables 
theorem for double integrals. Thus, by Definition 2.1 and the lemma just proved, 


ffres ~ JI, FOC, D) IINvG, Dll ds dt 


_ (u, v) 
= J [ FXG, D) E 


IINx(u(s, t), v(s, t))|| ds dt. 
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From the change of variables theorem, it follows that 


[fi ras= ff f(X(u, v)) INx(u, v)|| audv= | | fas, 


by Definition 2.1. a 


Proof of Theorem 2.5 This result can be established along the lines of the pre- 
vious proof. Beginning with Definition 2.2 and using the lemma just established, 
we have 


[[F-as= ff F(Y(s, t) - Ny(s, t) ds dt 


= / / PXG, D). S Nx(u(s, t), v(s, t)) ds dt. 
Dı s,t) 


Therefore, 


J [ F-dS=+ i | FRCL, DNx, 066.9) | 


aU asar 


a(s, t) 
where we take the “+” sign if Y is an orientation-preserving reparametrization 


of X (since the Jacobian 3(u, v)/3(s, t) is positive and hence equal to its absolute 
value) and the “—” sign if Y is orientation-reversing. By the change of variables 


theorem, this last expression is equal to 


«ff F(X(W,0))-Nulu, v)dudv =+ f f F-as, 


by Definition 2.2. 


7.2 Exercises 


- Let Xio t)= GS ti) O<s<1, O<t<2. 


Find 


[fe +y*4+27)dS. 
X 


. Let D = {(s, t) | s? +t? <1, s> 0, t > 0} and let 


X: D —> R? be defined by X(s, t) = (s + t, s — t, st). 
(a) Determine ffy f dS, where f(x, y, z) = 4. 


(b) Find the value of f fy F + dS, where F = xi + yj + 
zk. 


. Find the flux of F = xi + yj +zk across the sur- 


face S consisting of the triangular region of the plane 
2x — 2y +z =2 that is cut out by the coordinate 
planes. Use an upward-pointing normal to orient S. 


. This problem concerns the two surfaces given para- 


metrically as 


X(s, t) = (s cost, s sint, 357), 


0O<s<2,0 <t < 27, 


and 


Y(s, t) = (2s cost, 2s sint, 12s), 


O<s<10<t<4n. 


(a) Show that the images of X and Y are the same. 
(Hint: Give equations in x, y, and z for the sur- 
faces in R? parametrized by X and Y.) 

(b) Calculate [f,(yi- xj + 27k) -dS and [fj(yi- 
xj +z? k) + dS. Reconcile your answers. 


. Find ffy x”dS, where S is the surface of the cube 


[-2, 2] x [-2, 2] x [-2, 2]. 


. Find ff,(x? + y?) dS, where S is the lateral surface of 


the cylinder of radius a and height h whose axis is the 
z-axis. 


. Let S be a sphere of radius a. 


(a) Find [fœ + y? + 27) dS. 
(b) Use symmetry and part (a) to easily find f fọ y? dS. 


8. Let S denote the sphere x? + y? + z* =a’. 
(a) Use symmetry considerations to evaluate ff, x dS 
without resorting to parametrizing the sphere. 


(b) Let F=i+j+k. Use symmetry to determine 
JJ; F + dS without parametrizing the sphere. 


9. Let S denote the surface of the cylinder x? + y? = 4, 
—2 < z < 2, and consider the surface integral 


ffe =f y jds. 


(a) Use an appropriate parametrization of S to calcu- 
late the value of the integral. 

(b) Now use geometry and symmetry to evaluate the 
integral without resorting to a parametrization of 
the surface. 


In Exercises 10-18, let S denote the closed cylinder with bot- 
tom given by z = 0, top given by z = 4, and lateral surface 
given by the equation x? + y? =9. Orient S with outward 
normals. Determine the indicated scalar and vector surface 


integrals. 
10. Jf zas 11. [ [ves 
S 5 
13. Tf xd 
s 


12. Jf zas 
s 
14. ffeity-as 15. [ [eas 
s S 
17. ffCvi+ x-as 
š 


16. [[ vias 
5 

18. ffias 
S 


In Exercises 19-22, find the flux of the given vector field F 
across the upper hemisphere x? + y? + z2? = a?,z > 0. Orient 
the hemisphere with an upward-pointing normal. 


19. F= yj 20. F= yi-xj 


21. F=—-yi+xj—k 22. F = xi +xyj+xzk 

23. Let S be the parametrized helicoid X(s,t)= 
(s cost, s sint, t), withO < s < 2,0 < t < 2x. Deter- 
mine the flux of F = yi + x j + z° k across S. 


24. Let F = 2xi+2yj+z°k. Find ff, F -+dS, where S 
is the portion of the cone x? + y? = z? between the 


planes z = —2, and z = 1, oriented with outward- 
pointing normal. 
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25. Find the flux of F= y’zi— xyj+(x+y+z)k 
across the portion of the surface z = ye* lying over 
the unit square [0, 1] x [0, 1] in the xy-plane, oriented 
by upward normal. 


26. Let S denote the tetrahedron with vertices (0, 0, 0), 
(1, 0, 0), (0, 2, 0), (0, 0, 3) oriented by outward nor- 
mal, and let F = x? i+ 4z j + (y — x) k. Find the flux 
of F across S. 


27. Let S be the funnel-shaped surface defined by x? + 
y? =z forl <z < 9andx? + y? =1 for0<z <1. 


(a) Sketch S. 

(b) Determine outward-pointing unit normal vectors 
to S. 

(c) Evaluate ff, F-dS, where F = —yi+xj+zk 
and S is oriented by outward normals. 


28. The glass dome of a futuristic greenhouse is shaped 
like the surface z = 8 — 2x? — 2y?. The greenhouse 
has a flat dirt floor at z = 0. Suppose that the tempera- 
ture T, at points in and around the greenhouse, varies 
as 


T(x, y, z) =x? + y? +3- 27. 


Then the temperature gives rise to a heat flux density 
field H given by H = —kVT. (Here k is a positive con- 
stant that depends on the insulating properties of the 
particular medium.) Find the total heat flux outward 
across the dome and the surface of the ground if k = 1 
on the glass and k = 3 on the ground. 


29. The surface given by X(s, t) = (x(s, t), y(s, t), z(s, t)), 
where 


s s 
= (a + cos = sint — sin= sin 2r) cos s 
2 2 

So. ae : 
y= (a + cos 5 ee sin 2r) sins , 


ae So 2 
z= sis SUSED sin 2t 


a is a positive constant, and 0 < s < 27,0 < t < 27, 
is known as a Klein bottle. 
@ (a) Use a computer to plot this surface for a = 2. 


(b) Determine (and describe) the s-coordinate curve 
att = 0. 


(c) Calculate the standard normal vector N along the 
s-coordinate curve att = 0(i.e., find N(s, 0)). Note 
that X(0, 0) = X(2x, 0). By comparing N(0, 0) 
and N(2z, 0), comment regarding the orientabil- 
ity of the Klein bottle. (See Example 8.) 


490 


Chapter 7 | Surface Integrals and Vector Analysis 


7.3 Stokes’s and Gauss’s Theorems 


Here we contemplate two important results: Stokes’s theorem, which relates 
surface integrals to line integrals, and Gauss’s theorem, which relates surface 
integrals to triple integrals. Along with Green’s theorem, Stokes’s and Gauss’s 
theorems form the core of integral vector analysis and, as explained in the next 
section, can be used to establish further results in both mathematics and physics. 


Stokes’s Theorem 


Stokes’s theorem equates the surface integral of the curl of a C! vector field over a 
piecewise smooth, orientable surface with the line integral of the vector field along 
the boundary curve(s) of the surface. Since both vector line and surface integrals 
are examples of oriented integrals (i.e., they depend on the particular orientations 
chosen), we must comment on the way in which orientations need to be taken. 


DEFINITION 3.1 Let S be a bounded, piecewise smooth, oriented surface 
in R>. Let C’ be any simple, closed curve lying in S. Consider the unit normal 
vector n that indicates the orientation of S at any point inside C’. Use n to 
orient C’ by a right-hand rule, so that if the thumb of your right hand points 
along n, then the fingers curl in the direction of the orientation of C’. (Equiva- 
lently, if you look down the tip of n, the direction of C’ should be such that the 
portion of S bounded by C’ is on the left.) We say that C’ with the orientation 
just described is oriented consistently with S or that the orientation is the 
one induced from that of S. Now suppose the boundary 0S of S consists 
of finitely many piecewise C!, simple, closed curves. Then we say that 0S 
is oriented consistently (or that 0S has its orientation induced from that 
of S) if each of its simple, closed pieces is oriented consistently with S. 


Some examples of oriented surfaces with consistently oriented boundaries 
are shown in Figure 7.30. If the orientation of S is reversed, then the orientation of 
dS must also be reversed if it is to remain consistent with the new orientation of S. 

Now we state a rather general version of Stokes’s theorem, a proof of which 
is outlined in the addendum to this section. 


Figure 7.30 Examples of oriented surfaces 
and curves lying in them having consistent 
orientations. On the right, the boundary of S2 
consists of three simple, closed curves. 


Figure 7.31 The 
paraboloid 
z=9-x — y 
oriented with upward 
normal n. Note that 
the boundary circle 
C is oriented 
consistently with S. 
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THEOREM 3.2 (STOKES’S THEOREM) Let S be a bounded, piecewise smooth, 
oriented surface in R*. Suppose that 4S consists of finitely many piecewise C!, 
simple, closed curves each of which is oriented consistently with S. Let F be a 
vector field of class C! whose domain includes S. Then 


[[vxP-as=f Peas. 
s as 


Theorem 3.2 says that the total (net) “infinitesimal rotation,” or swirling, of 
a vector field F over a surface S is equal to the circulation of F along just the 
boundary of S. 


EXAMPLE 1 Let S be the paraboloid z = 9 — x? — y? defined over the disk 
in the xy-plane of radius 3 (i.e., S is defined for z > 0 only). Then 0S consists of 
the circle 


C={(x,y,z) |x? +y? =9, z = 0}. 


Orient S with the upward-pointing unit normal vector n. (See Figure 7.31.) We 
verify Stokes’s theorem for the vector field 


F = (2z — y)i + (x + z)j + Gx — 2y) k. 


We calculate 
i j k 
VxF=)] d/dx d/dy 0/0z 
22-y x+z 3x-2y 


= (—2 — Di+@-3)j+d — (—1))k = —3i — j + 2k. 
An upward-pointing normal vector N is given by 
N =2xi+2yj+k. 


(This vector may, of course, be normalized to give an “orientation normal” n.) 
Therefore, using formula (5) of §7.2 we have, where D = {(x, y) | x? + y? < 9}, 


J| 5xt-as= | [-3i-i+20-@xi+ 2) + Waray 
s D 


= f | 6x -2+ 2dxdy 


=| ~6xdxdy— | [ 2ydxdy+ f f 2dxay. 
D D D 


By the symmetry of D and the fact that —6x and 2y are odd functions, we have 
that the first two double integrals are zero. The last double integral gives twice 
the area of D. Thus, 


[fy x F-dS = 2-2(3”) = 18m. 
s 
On the other hand, we may parametrize the boundary of S as 


x = 3 cost 
y =3sint 0<t <2n. 
z=0 
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Figure 7.32 The surface 
S= {(x, y, z) | 

z= ene ty), z > Il/e} 
has boundary 

aS = {(x, y, z) | 

x? +y = l, z = 1/e}. 


(This parametrization yields the orientation desired for 0S.) Then 


20 
f F-ds = I FO) -x'(t)dt 
as 0 


27 


= (0 —3sint, 3cost + 0,9 cost — 6sint)-(—3sint, 3 cost, 0) dt 
0 


27 27 
= Osin? +900 r)dt = | 9dt = 187; 
0 0 


which checks. + 


EXAMPLE 2 Consider the surface S defined by the equation z = e~* +”) for 
z > l/e (i.e., S is the graph of f(x, y) = e7 +°) defined over D = {(x, y) | 
x? + y? < 1}). Let 


F = (© — 2y)it (xet + y)jj+ e” k. 


Then, no matter which way we orient S, we can see that ff, V x F » dS looks im- 
possible to calculate. Indeed, suppose we take the upward-pointing normal vector 


N= 2xe 4") i+ 2ye j +k. 
Then, because 
V xF = (eP — xet) i+ (eT — e*)j + 2k 
(you may wish to check this), using formula (5) of §7.2, we find that 


[| xr-as 
s 


= I] ane ee? — xet?) + 2ye H (et =e TI g 2| dx dy. 
D 


We will not attempt to proceed any further with this calculation. 
It is tempting to use Stokes’s theorem at this point, since the boundary of S is 
the circle x? + y? = 1, z = 1/e. (See Figure 7.32.) If we parametrize this circle by 


x = cost 
ana 0<t<2r, 
L = = 

e 


then 


$ F-ds 
as 


20 
=f (ette L 2 sint, cost e™®!tl/e + sinz, eo") .(— sint, cost, 0) dt 
0 


2n 
= (2 sin’ t — sint e 
0 


sint+1/e sint+1/e 


+ cos” te + cost sint)dt. 
Again, we have difficulties. 

However, the power of Stokes’s theorem is that if S’ is any orientable 
piecewise smooth surface whose boundary 3S’ is the same as 0S then, subject to 


---~N 


i 


a aS = 0S’ 


Figure 7.33 Both S and S’ 
have the same boundary and 
are oriented as indicated. 
Therefore, by Stokes’s theorem, 
Sf V XF-dS= 


JSfy V XF. dS. 
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orienting S’ appropriately, 


[[vxr-as=¢ Feds= § Feds= |) VxF-ds. 
S as as’ ” 


Hence, we may evaluate ff 5 V x F -dS by using a different surface! (See Fig- 
ure 7.33.) 

To use this fact to our advantage, note that V x F has a particularly simple 
k-component. Thus, we let S’ be the unit disk at z = 1/e: 


S = {Œœ y, z) |x? +y? <1,z= le}. 


Consequently, if we orient S’ by the unit normal vector n = +k, we have 


[[vxnas= ff vxras 
= | [ovxe-mas 


= f | 24s =2-area ofS = 27. + 


Gauss’s Theorem 


Also known as the divergence theorem, Gauss’s theorem relates the vector 
surface integral over a closed surface to a triple integral over the solid region 
enclosed by the surface. Like Stokes’s theorem, Gauss’s theorem can assist with 
computational issues, although the significance of the result extends well beyond 
matters of calculation. 


THEOREM 3.3 (GAUSS’S THEOREM) Let D be a bounded solid region in R? 
whose boundary 3 D consists of finitely many piecewise smooth, closed orientable 
surfaces, each of which is oriented by unit normals that point away from D. (See 
Figure 7.34.) Let F be a vector field of class C! whose domain includes D. Then 


fp r-as= ff vrav. 


Figure 7.34 A solid region D whose 
boundary surfaces are oriented so that 
Gauss’s theorem applies. 


By a closed surface, we mean one without any boundary curves, like a sphere 
or a cube. The symbol ff is used to indicate a surface integral taken over a closed 
surface or surfaces. 
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Ny 
Ls 


Figure 7.35 The 
solid cylinder D of 
Example 3. 


Gauss’s theorem says that the “total divergence” of a vector field in a bounded 
region in space is equal to the flux of the vector field away from the region (i.e., 
the flux across the boundary surface(s)). 


EXAMPLE 3 Let F be the radial vector field x i + yj + zk and let D be the 
solid cylinder of radius a and height b, located so that axis of the cylinder is the 
z-axis and the top and bottom of the cylinder are at z = b and z = 0. (See Fig- 
ure 7.35.) We verify Gauss’s theorem for this vector field and solid region. 

The boundary of D consists of three smooth pieces: (1) the bottom surface S 
that is a portion of the plane z = 0 and oriented by the normal n; = —k, (2) the 
top surface S> that is a portion of the plane z = b and is oriented by the normal 
vector nz = k, and (3) a portion of the lateral cylinder $3 given by the equation 
x? + y? = a’ and oriented by the unit vector n3 = (x i + y j)/a. (The vector n3 
may be obtained by normalizing the gradient of f(x, y, z) = x? + y? that defines 
S3 as a level set.) Then 


fb F-ds= |f F-ds+ ff F-ds+ ff F-dS 
aD Sy S2 S3 


-| (i+ yj+ck)-(—kds4 I (xi+yj+zk)-kds 
Sı S2 
+f wit yitcw- (Fi) as 
S3 a 
2 2 
= ff -cas+ ff cass [| => dS 
Si S> S3 a 
az 
=0+ ff bas+ f fas, 
So S3 a 


since along Sj, z is 0; along Sy, z is equal to b; and along S3, x? + y? = a°. Thus, 


i) F -dS = b-area of S2 + a » area of S3 = bra? + a(2mrab) = 32a7b, 
aD 


from familiar geometric formulas. 
On the other hand, 


a ð 0 
V-F= = 3; 
gO OG O 
so that 


J vray = fff 3dV =3-volume of D = 3za’b, 
D D 


which can be checked readily. + 


In general, if F = xi + yj + z kand D is a region to which Gauss’s theorem 
applies, then 


f r-as= | | | v-rav= [ff 3dV =3-volume of D. 
aD D D 


Hence, 


ff (xi+ yj+zk)-dS = volume of D. 
aD 


Figure 7.36 The union 
of the surfaces S and S’ 
enclose a solid region D 
to which we may apply 
Gauss’s theorem. 
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Therefore, we may use surface integrals to calculate volumes in much the same 
way that we used Green’s theorem to calculate areas of plane regions by means 
of suitable line integrals. (See §6.2, especially Examples 2 and 3.) 


EXAMPLE 4 Let 
F =e coszit+ vx? + lsinzj+(x? + y? + 3)k, 
and let S be the graph of 
z=(1- a yhel for z > 0, 


oriented by the upward-pointing unit normal vector. It is not difficult to see that 
JT. s F -dS is impossible to evaluate directly. However, we will see how Gauss’s 
theorem provides us with elegant indirect means. 

Consider the piecewise smooth, closed surface created by taking the union of 
S and S’, where S’ is the portion of the plane z = 0 enclosed by 0S (i.e., the disk 
x? + y? < 1,z = 0). Orient S’ by the downward-pointing unit normal z = —kas 
shown in Figure 7.36. Note that S U S’ forms the boundary of a solid region D 
and, furthermore, that the orientations chosen enable us to apply Gauss’s theorem. 
Doing so, we have 


[[v-as+ ff vas fp r-as= fff vrav. 


Now, it is a simple matter to check that V - F = 0 for all (x, y, z). Therefore, the 
triple integral is zero, and we find that 


[[v-as+ ff F-as=0. 
[{vas=- ff vas 


In other words, because F is divergenceless, Gauss’s theorem allows us to replace 
the original surface integral by one that is considerably easier to evaluate. Indeed, 
we have 


[ [v-as=- f| -cwas= f fo? +9*+3aray, 


where R is the unit disk {(x, y) | x? + y? < 1} in the plane. Now, we switch to 
polar coordinates to find 


20 1 
[ [e+y+aaxay= f j! (r? +3)r dr dO 
R 0 0 


2x 
1 
=f GARA 


so that 


EXAMPLE 5 Consider the vector field 
_ xityj+zk 
~ 2+ y2 + 2293/2" 
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(This is an example of an inverse square vector field.) The flux across the sphere 
x? + y? +z? = a’ oriented by outward unit normal n is given by ff; F-dS = 
ff s F-ndS. The unit normal to S may be computed as 


na V(x? +y? +2") _ 2xi+2yj+2zk  xi+yj+zk 
IVE +y +2) (4x2 + 4y? $422 a ' 
since x? + y? + z? = a? on the surface of the sphere. In a similar way, we may 


write F(x, y, z) as (x i + y j + z k)/a? whenever (x, y, z) is a point on the sphere. 
Hence, 


. Stok . . 
fras- Prenas =f (HHI). (Att ek) gs 
S S S a a 
2 2 2 
-fC Eas 
S a 


a? 1 
i) (=) dS = — (surface area of S) 
s Ka a 


4 (47a) = 4r. 


II 


II 


Now we note that the partial derivative of the first component of F with 
respect to x is 


OF, tyt z2 — 3x2(x?2 + y2 + 2)? 2x2 4 y2 4 2? 
ae Q2 Fy HZ2 = 2+ y? +z 
Similarly, 
OF a2? OF x? +y = 22? 
dy ~ (x? + y? + 27)9/2 ane az (x2 + y? + 72)9/2° 


Thus, V - F = 0, and so any triple integral of V - F must be zero. This would seem 
to be at odds with Gauss’s theorem. There is no contradiction, however. Note that 
the vector field F is not defined at the origin. Therefore, the hypothesis that F 
be defined throughout the solid region enclosed by S is not satisfied and, hence, 
Gauss’s theorem does not apply in this situation. + 


The Meaning of Divergence and Curl 
Part of the significance of Stokes’s theorem and Gauss’s theorem is that they 
provide a way to understand the meaning of the divergence and curl of a vector 
field apart from the coordinate-based definitions of §3.4. By way of explanation, 
we offer the following two results: 


PROPOSITION 3.4 Let F be a vector field of class C! in some neighborhood of 
the point P in RÌ. Let Sa denote the sphere of radius a centered at P, oriented 
with outward normal. (See Figure 7.37.) Then 

div F(P) = lim 


3 
fb F- ds. 
a>0+ 4ra? S 


Figure 7.37 A 

sphere of radius a PROOF We have, by Gauss’s theorem, that 

used to understand 3 3 

the divergence of a lim fp F-dS = lim JT] divF dV, (1) 
vector field. a>0+ 4ra? Sa a>0* 4ra? Da 


PAC, = aS, 


S 


a 


Figure 7.38 A circle of 
radius a centered at P. 
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where Da is the solid ball of radius a enclosed by Sa. Next, we use a result 
known as the mean value theorem for triple integrals, which states that if f 
is a continuous function of three variables and D is a bounded, connected, solid 
region in space, then there is some point Q € D such that 


Jf to»2aVv= f(Q). volume of D. 


In our present situation, this result implies that there must be some point Q € Da 
such that 


Tf] div F dV = div F(Q)- (volume of D4) = (47ra?) div F(Q). (2) 
Da 


Applying formula (2) to formula (1), we have 


ım 
a—>0+t 4ra? 


i) F-dS = lim div F(Q) = div F(P), 
Sa a(t 


since, as a — 07, the ball D, becomes smaller, “crushing” Q onto P. E 


PROPOSITION 3.5 Let F be a vector field of class C! in a neighborhood of the 
point P in R°. Let C, be the circle of radius a centered at P situated in the plane 
containing P that is perpendicular to the unit vector n. (See Figure 7.38.) Then 
the component of curl F(P) in the n-direction is 
n- curl F(P) = Im 5 f F -ds, 
Wa" Jc, 


where C, is oriented by a right-hand rule with respect to n. 


PROOF Let Sa denote the disk of radius a in the plane of C, enclosed by Ca. By 
Stokes’s theorem, 


lim ET) F-ds= lim ae VxF-dS 
a—>0t ma? on a—>0*t zma? 
= im ga f |O ~P was. a 


There is a mean value theorem for surface integrals (similar to the mean value 
theorem for triple integrals used in the proof of Proposition 3.4) enabling us to 
conclude that there must be some point Q in Sa for which 


/ (V x F-n)dS = (V x F(Q)-n)(area of S,) 
= ma°(V x F(Q)- n), (4) 
since S, is a disk of radius a. Therefore, using equations (3) and (4), we find that 


1 e od 2 
lim gg $, Pis im lav x F(O)-m 


= lim V x F(Q)-n 
= V x F(P)-n, 


since Q > P asa —> 0*. E 
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Figure 7.39 The 
configuration needed 
to calculate the 
k-component of 

curl F(xo), using 
Proposition 3.5. 


Propositions 3.4 and 3.5 justify our claims, made in §3.4, about the intuitive 
meanings of the divergence and curl of a vector field. The quantity Bs, F-dS 
used in Proposition 3.4 is the flux of F across the sphere S,, and so 


lim : fp F-dS 

a—>0t 4ra? S 

is precisely the limit of the flux per unit volume, or the flux density of F at P. 
Similarly, 


1 
lim —¢ F-ds 
a—>0r ma? GA 


is the limit of the circulation of F along C, per unit area, or the circulation 
density of F at P around n. In particular, Proposition 3.5 shows that curl F(P) 
is the vector whose direction maximizes the circulation density of F at P and 
whose magnitude is equal to the circulation density around that direction (or else 
curl F(P) is 0 if the circulation density is zero). 

In fact, we can turn our approach to divergence and curl completely around 
and, instead of defining the divergence and curl by means of coordinates and the 
del operator and proving Propositions 3.4 and 3.5, use the surface and line integral 
formulas of Propositions 3.4 and 3.5 to define divergence and curl and derive the 
coordinate formulations from the limiting integral formulas. 

Write F as 


M(x, y, z)i + N(x, y,z)j + P(x, y,z)k, 


where M, N, and P are functions of class C! in a neighborhood of the point 
Xo = (xo, Yo, Zo). We will demonstrate how to recover the coordinate formula 


ver (> ~)i | (= Pi | (= Mk 
dy Oz az Ox Ox dy 

from the formula in Proposition 3.5. (A similar argument can be made to derive 
the coordinate formula for the divergence, the details of which we leave to you.) 
The idea is to let the unit vector n equal, in turn, i, j, and k in the formula in 
Proposition 3.5 and thereby to determine the components of the curl. 

First, let n = k, so that C, is the circle of radius a in the horizontal plane 
Z = Zo, oriented counterclockwise around k. (See Figure 7.39.) Then Ca may be 
parametrized by 


x = xo + a cost 
y= yy +asint 0<t<2x. 
g= g0 


Therefore, 


2x 
f Feds= | F(x(t)) + x'(t) dt 
Ga 0 


20 
= | (M(x(t))i+ N(x(t))j + POK) + (—asintitacostj)dt 
0 


20 
=a (—sint M(x(t)) + cost N(x(t))) dt. 
0 
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Next, use Taylor’s first-order formula (see §4.1) on M and N, which yields, near 
Xo (i.e., for (x, y, z) © (xo, Yo, Zo)), 
M(x, y, z) © M(xo) + Mx(X0)(x — xo) + My(Xo)(y — yo) + Mz(Xo)(Z — zo); 
N(x, y, z) © No) + Nx(Xo)(x — x0) + Ny(Ko)(y — yo) + Nz(X0)(Z — zo). 
Along the small circle Ca, we have x — x9 = a cost, y — yo = a sint, and z — 


zo = 0, so that, using the approximations for M and N, we have 


27 
f F-ds~a | —sin t [M (x0) + M; (Xo)a cost + M, (Xo)a sint ] dt 
Ca 0 


27 
+a / cos [N (x0) + Nx(Xo)a cost + N,(xXo)a sint | dt 
0 
27 2n 
= -am(xo) | sint dt — amx) | sint cost dt 
0 0 


2n 20 
— a’M,(Xo) | sin? t dt + aN (xo) i cost dt 
0 0 


20 27 
+a? N, 0) / cos’ t dt + a? Ny (Xo) | sint costdt. (5) 
0 0 


This last equality holds because M (xo), M,(X0), etc., do not involve t and so may 
be pulled out of the appropriate integrals. You can check that 


2n 20 27 
/ sintdr = | costar = | sint costdt = 0, 
0 0 0 


2x 2n 
Í sin? t dt = Í cos? tdt =z. 
0 0 


Therefore, the approximation in (5) simplifies to 
§ F-ds~ —ma*M,(Xxo) + ma? N,(Xo). 
Ca 


Now, the error involved in the approximation for fe F -ds tends to zero as Ca 
becomes smaller and smaller. Thus, 


1 
k-curl F(xo) = lim ET) F-ds 
a—>0t mta Cy 


= lim (—My(xo) + Nx(Xo)) 


a>0t 
= Nx(x0) — My(Xo). 
If we let n = j, so that C4 is the circle parametrized by 


x = xo tasint 
y= Yo 0<t<2r, 
Zz = Zo +a cost 


then a very similar argument to the one just given shows that 


j- curl F(xo) = M: (x0) — Px (Xo). 


500 Chapter 7 | Surface Integrals and Vector Analysis 


Finally, let n = i, so that C, is parametrized by 


X = X0 
y = yy +acost O<t<2nz, 
z =Z +asint 


to find 
i- curl F(xo) = P, (x0) — N: (x0). 


We see that the i-, j-, and k-components of the curl of F are as stated in §3.4. 


Addendum: Proofs of Stokes’s and Gauss’s Theorems - 


Proof of Stokes’s theorem 
Step 1. We begin by establishing a very special case of the theorem, namely, 
the case where the vector field F = M(x, y, z)i(i.e., F has an i-component only) 
and where the surface S is the graph of z = f(x, y), where f is of class C! ona 
domain D in the plane that is a type 1 elementary region. (See Figure 7.40.) To 
be explicit, the region D is 


(Qa y) | YO) < y Soe), a <x <b}, 


where y and ô are continuous functions. We assume that S is oriented by the 
upward-pointing unit normal. 


OMe ec 


© h=) 


x D Top view 


Figure 7.40 The surface S is the graph of f(x, y) for (x, y) in 
the type 1 region D shown at the right. 


First, evaluate f s F- ds. The boundary 0S consists of (at most) four smooth 
pieces parametrized as follows. (The subscripted curves correspond to the encir- 
cled numbers in Figure 7.40.) 


x=t x=b 

Ci: yy=yvt) ast<b; C: yy=t y(b) < t < 6(b); 
z= f(t, y@)) z= f(b,t) 
=f x=a 

C3: {$ y= ô(t) a<t<b; Ch {yal y(a) =t < d(a). 
z= f(t, êt) z= f(a,t) 


The parametrizations shown for C3 and C4 induce the opposite orientations to 
those indicated in Figure 7.40. Therefore, 


f F-as= | F-as+ | F-as- | Feds f F- ds. 
as Ci & C3 Gi 
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Consider the integral over C1. Since F has only an i-component, 


b 
J F-ds= Mdx+0dy+0dz= M(t, y(t), f(t, y@))) dt. 
Ci Ci 


a 


The line integral f oF: ds may be calculated in a manner similar to that for 
f c, F+ ds. In particular, we obtain 


b 
l F-ds= M(t, d(t), f(t, d(t))) dt. 
C3 a 
For the integral over C2, note that x is held constant. Thus, 
/ F-ds= M dx =Q. 
C C 


Likewise, f c, F + ds = 0. The result is 


b b 
f F-ds= / M(t. y(t), FE, yO) dt — / M(t, 8(t), f(t, 5(t))) dt 
os a 


a 


b 
[M(x, v(x), f(x, ya) — MQ, d(x), F(x, (x) dx. 6) 


a 


(In this last equality we’ve made a change in the variable of integration.) 
Now we compare the line integral to the surface integral ffy V x F» dS. For 
F = M(x, y, z)i, we have V x F = M, j — M; k, so formula (5) of §7.2 yields 


[ [vrs | (M,j — M; k): (—fxi— fyj +k dx dy 
S D 

e 7°) (aM a M 

a Jy(x) dz dy dy 


dM o0Mod 
(MCs ys Fle = 


Thus, using the fundamental theorem of calculus, we have 


f| 5r a doas y, f(x, y))) dy dx 


- M(x, y, f(x, PIO dx 


The chain rule implies 


a 


b 
= | [—M(x, d(x), f(x, 6x) + MQ, v(x), fæ, Ya) ax, 
which agrees with equation (6). 

We may readily extend this result to surfaces that are graphs of z = f(x, y), 
where the point (x, y) varies through an arbitrary region D in the xy-plane via a 
two-stage process: First, establish the result for regions D that may be subdivided 
into finitely many elementary regions of type 1 and then apply a limiting argument 
similar to the one outlined in the proof of Green’s theorem in §6.2. 
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Figure 7.41 The surface S 
is a portion of the plane 
x=c.IfF= M(x, y,z)i, 
then F is always 
perpendicular to S, in 
particular, to any vector T 
tangent to ð S. 


Figure 7.42 The surfaces Sj 
and Sz share the curve C as 
part of their boundaries. C 
receives one orientation as 
part of 0S, and the opposite 
orientation as part of 0S. 
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Step 2. Still keeping F = Mi, note that the argument given in Step 1 works 
equally well for surfaces of the form y = f(x, z)—simply exchange the roles of 
y and z throughout Step 1. 

It is also not difficult to see that if S is a portion of the plane x = c (where 
c is a constant), then Stokes’s theorem for F = Mi holds for this case, too. On 
the one hand, n = +i for such a plane (depending on the orientation chosen) and 
V x F = M; j — M; k, as we have seen. Hence, 


[[vxr-as= f [ov xr-mas= ff oas=o. 


On the other hand, since F has only an i-component, F is always parallel to 
n and thus perpendicular to any tangent vector to S, including vectors tangent 
to any boundary curves of S. Therefore, f s F -ds = 0 also. (See Figure 7.41.) 
Now, suppose that S = Sı U S2, where Sı and Sz are each one of the graph of 
z= f(x, y), the graph of y = f(x, z), or a portion of the plane x = c. Assume 
that Sı and Sz coincide along part of their boundaries, as shown in Figure 7.42. 
The surfaces Sı and S2 inherit compatible orientations from S. If C denotes 
the common part of ð Sı and 3S2, then we may write 0S, as Cı UC and 0S) as 
C2 U C, where C; and C} are disjoint (except at their endpoints). If 0S; is oriented 
consistently with S; fori = 1, 2, then note that C will be oriented one way as part 
of ð Sı and the opposite way as part of 3S2. From this point, let us agree that C 
denotes the curve oriented so as to agree with the orientation of 0S}. 

Now Stokes’s theorem with F = Mi holds on both Sı and S2; on Sı we have 


Tf VxF-dS= $ F-as= | Feds+ | F-ds (7) 
Sı aS) Ĉi C 


whereas on > we have 


I VxF-ds= $ F-as= | Feds— | F-ds (8) 
Sy dS C2 Cc 


in view of the remarks made regarding the orientation of C. Now, we consider 
S = Sı U Sy, noting that C is not part of 0S. We see that 


[[vxras= vxF-ds+ f) VxF-dS. 
S Sı S2 


Using equations (7) and (8) and canceling, we find that 


[ Fast | Feds= ġ F -ds. 
Gi C2 as 


Thus, Stokes’s theorem holds in this case, or, indeed, in the case where S can be 
written as a finite union S1 U $2 U --- U S, of the special surfaces just described. 

From a practical point of view, any particular surfaces you are likely to 
encounter will be decomposable as finite unions of the special types of surfaces 
previously described. However, not all piecewise smooth surfaces are, in fact, of 
this form. So to finish a truly general proof of Stokes’s theorem when F = Mi, 
some further limit arguments are needed, which we omit. 


Step 3. Finally, by permuting variables, we essentially repeat Steps 1 and 2 in 
the cases where F = N(x, y, z)j or F = P(x, y, z)k. In general, by the additivity 


x R 


Figure 7.43 The type 1 region 
D and its shadow region R in the 
xy-plane. 
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of the curl, we have that 


[[vxras= [fv xmit i+ Pw-as 
= f| vxcrn-as+ [fv xwi-as 
+ [| vx(Pw-as. 


Using the versions of Stokes’s theorem just established, we see that 


[[vxP-as= Mi-ds+ Ni-as+ f Pk-ds, 
S ðS as as 


-¢$ (Mi+Nj+PK)-ds= dp F-ds, 
os as 


as desired. E 


Proof of Gauss’s theorem 


Step 1. We prove a very special case of Gauss’s theorem, namely, the case in 
which F = P k (where P(x, y, z) is of class C! on a domain that includes the 
solid region D) and where D is an elementary region of type 1, as in Figure 7.43. 
We denote the bottom surface boundary of D by Sı and take it to be given by 
the equation z = g(x, y), where ¢ is of class C! and, similarly, we let S2 denote 
the top surface boundary and assume it is given by the equation z = w(x, y), 
where w is also of class C!. The lateral surface boundary is denoted by 53; it may 
reduce to a curve or be empty but otherwise is a cylinder over the boundary curve 
of a region R in the plane forming the shadow of D. 

Orienting 0D = Sı U S2 U $3 with outward-pointing normal vectors, we have 


fb r-as= | | r-as+ ff r-as+ ff F- dS. 
aD Sı So S3 


The orientation normal to $; should be downward-pointing, hence parallel to 
(xi + pyj — k, the opposite of the normal vector obtained from the standard 
parametrization of S|. Therefore, using formula (5) of 87.2, we have 


I F-ds= |f E Cy E E E ree 
Sı R 


= -Jf P(x, y, p(x, y))dx dy. 


Similarly, the orientation normal to Sọ should be upward-pointing, and so 


J| Eas- [Pen ve. Cwi- ti + Waray 


= I P(x, y, W(x, y)) dx dy. 


Now the lateral surface $3, if it is nonempty, is a cylinder over a curve in the 
xy-plane. Hence, 53 is defined by one or more equations of the form g(x, y) = c. 
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Figure 7.44 The regions Dı and 
Dy share the surface S as part of 
their boundaries. This common 
surface S inherits one orientation 
as part of dD, and the opposite 
orientation as part of ð D2. 


It follows that any normal vector to $3 can have no k-component. Thus, 


II F-ds= | Pxmas= |f 0dS=0. 


Putting these results together, we find that 


f F-ds= | f [Pæ y, w(x, y)) — PC, y, p(x, y))] dx dy. 
aD R 


On the other hand, if F = Pk, then V-F = 0P/0z, so that, by the funda- 
mental theorem of calculus, 


Wy) 3P 
[[[v-rav=ff | — dzdx dy 
D RI g(x,y) 92 


z / Í PERV — EE E T 


Therefore, Gauss’s theorem holds in this special case. 


Step 2. We may repeat Step 1 for F = Mi and D an elementary region of 
type 2, and for F = Nj and D of type 3. If D is an elementary region of type 4 
(meaning D is simultaneously of types 1, 2, and 3), then 


ii F-as= ff (Mi+Nj+Pk)-dS 
aD aD 
= mi-as + fp vi-as+ fp Pk-dS 
aD aD aD 
= [f| v-miav+ ff vewiavs fff v- Prav 
D D D 
= [f| v-mmi+Ni+ Pwav 
D 
= [f| v-rav. 
D 


Step 3. Suppose that D = D, U Dy, where each of D; and D, is a type 4 
elementary region, and that Dı and D> coincide just along part of their boundaries 
as shown in Figure 7.44. Let S denote the common part of 0D; and 0 D2. Then 
we write 0D, as Sı U S and ð D3 as S2 U S, where Sı and Sz are disjoint (except 
perhaps along portions of their respective boundaries). If we orient ð Dı and 3 D2 
with outward normals so as to satisfy the hypotheses of Gauss’s theorem, then S 
will be oriented one way as part of 0D, and the opposite way as part of ð D2. Let 
us agree that the symbol S denotes the surface oriented so as to agree with the 
orientation of 0D). 

Applying Gauss’s theorem to both D; and D3, we obtain 


[[[ vrav =f was= ff v-as+ [[r-as 
J| av= F-as= ff ras- ff r-as 
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Combining these last two equations, we find that 


[ff vrava fff vrav fff vrav 


since OD = Sı U Sp. 


// F-ds+ |f F-dS 
Si S2 
f F- dS, 

aD 


Step 4. The result of Step 3 may be extended to regions D that can be 
decomposed as a union of finitely many type 4 regions. However, not all regions 
to which Gauss’s theorem applies meet this criterion. Consequently, to finish a 
truly general proof, we once again need an argument using suitable limits of 
regions and their integrals, which we omit. E 


7.3 Exercises 


In Exercises 1—4, verify Stokes s theorem for the given surface 
and vector field. 


1. S is defined by x? +y? +5z=1, z > 0, oriented by 
upward normal; 


F=xzi+ yzj + (+ y^k. 


2. S is parametrized by X(s, t) = (s cost, s sint, t), 0 < 
s <1,0<t<x/2; 


F=zi+xj+yk. 
3. S is defined by x = y 16 — y? — z?; 
F=xi+yj+zk. 


4. S is defined by x? + y? +z? = 4, z < 0, oriented by 
downward normal; 


F = (2y — z)i + (x + y? — z)j + (4y — 3x)K. 


5. Let S be the “silo surface,” that is, S is the union of 
two smooth surfaces Sı and S2, where S is defined by 


go. y? = 9. 

and S is defined by 
x+y +(z—8) =9, 

Find ffy V x F» dS, where 


0<z<8 
z> 8. 


F = (x? + xz + yz’) i + (xyz? + y)j4+272k. 
In Exercises 6-9, verify Gauss s theorem for the given three- 
dimensional region D and vector field F. 
6. F=xi+yj+zk, 
D = {œ y,z) 10 <z 59- x? — y?) 
7. F=(y—x)i+(y—z)j+(x— y)k, D is the unit 
cube [0, 1] x [0, 1] x [0, 1] 


10. 


11. 


12. 


13. 


. F=x°i+yj+zk, D = {(x, y,z) |x? +y +1 < 


z <5} 
F : k 

ao oe D = {(x, y, z) | a° < x? + 

Vx +y Haz? 
y+2 <b} 
Verify that Stokes’s theorem implies Green’s theo- 
rem. (Hint: In Stokes’s theorem take F(x, y, z) = 
M(x, y)i+ N(x, y)j; that is, assume F is independent 
of z and that its k-component is identically zero.) 


Let S be the surface defined by y = 10 — x? — z? with 
y > 1, oriented with rightward-pointing normal. Let 


F = (2xyz + 5z)i + e* cos yzj + x°y kK. 


[ [vxreas. 
s 


(Hint: You will need an indirect approach.) 


Determine 


Let S be the surface defined as z = 4 — 4x? — y? 
with z > 0 and oriented by a normal with nonnegative 
k-component. Let F(x, y, z) = xi + e” j+ ze®k. 
Find ff, V x F-dS. (Hint: Argue that you can inte- 
grate over a different surface.) 


(a) Show that the path x(t) = (cost, sinf, sin 2t) lies 
on the surface z = 2xy. 


(b) Evaluate 
fo + cosx)dx + (sin y + z?) dy + x dz, 
C 


where C is the closed curve parametrized and ori- 
ented by the path x in part (a). 
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14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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Let S consist of the four sides and the bottom face of 
the cube with vertices (Œa, +a, +a). Orient $ with 
outward-pointing normals. Evaluate ff, VxF- dS, 
where F = x? yz? i + x?yj + xe” sin yzk. 


Use Stokes’s theorem to find the work done by the vec- 
tor field F = (xyz — e*)i — xyz j + (x?yz + sin z) k 
on a particle that moves along the line segments from 
(0, 0, 0), then to (1, 1, 1), then to (0, 0, 2), then back to 
(0, 0, 0). 


Let C be a simple, closed curve that lies in the plane 
2x — 3y + 5z = 17. Show that the line integral 


$a cosx + z)dx + (5x — e”)dy — 3y dz 
c 


depends only on the area enclosed by C and its ori- 
entation, not on its particular shape or location in the 
plane. 


Use Gauss’s theorem to find the volume of the solid re- 
gion bounded by the paraboloids z = 9 — x? — y? and 
z = 3x? +3y? — 16. 


Let S be defined by z = e!=} =), z> 1, oriented 
by upward normal, and let F = xi + yj + (2 — 2z) k. 
Use Gauss’s theorem to calculate 


[ [e-as. 


Give a proof of Stokes’s theorem for smooth, 
parametrized surfaces S = X(D), where X: DC 
R? — R?. To make the proof easier, assume that X 
is of class C? and that it is one-one on D (in which 
case 0S = X(dD)). 


Use Gauss’s theorem to evaluate 


[[F-as. 


where F=ze* i+3yj+(2—yz’)k and S is the 
union of the five “upper” faces of the unit cube 
[0, 1] x [0, 1] x [0, 1]. That is, the z = 0 face is not 
part of S. (Hint: Note that S is not closed, so to apply 
Gauss’s theorem you will have to close it up.) 


In this problem, let f(x, y, z) be a scalar-valued func- 

tion of class C!, and let D be a region in space to 

which Gauss’s theorem applies. Let n = (n1, n2, n3) 

be the outward unit normal vector to S = aD. 

(a) Ifa is any constant vector and F = fa, show that 
V-F=Vf-ea. 

(b) Use (a) with that 


a 
fp fnidS = Tif af ay. Also obtain similar 
Ss D Ox 


results by letting a equal j and k. 


part a=i to show 


22. 


23. 


24. 


(c) Define a vector quantity f fdS= fb fndS by 
s s 


fp tnas = (ff fas, fads. ff fra). 


With notations and definitions as above, show that 


fris- [ff 


(Note that the right side is a triple integral of a 
vector-valued expression, so it is also computed 
by integrating each scalar component function.) 


Given a liquid with constant density ô, introduce co- 
ordinates so that the (flat) surface of the liquid is the 
xy-plane and the z-coordinate measures the depth of 
the liquid from the surface. (That is, the positive z-axis 
points down into the liquid.) Then the pressure p inside 
the fluid due to gravity is given by p(x, y, z) = 6gz, 
where g is acceleration due to gravity. Suppose that a 
solid object is immersed in the liquid. If the object fills 
out a region D in space, then the total buoyant force 
on the solid is the total liquid pressure on the boundary 
surface S = 0D and is given by 


B= - ff pnas. 
s 


where n is the outward unit normal to S. (The negative 
sign arises because the pressure causes a force point- 
ing inward on the object.) Use the previous exercise to 
demonstrate Archimedes’ principle: The magnitude of 
the total buoyant force on an object equals the weight 
of the liquid displaced. 


Write a careful proof of the three-dimensional case of 
Theorem 3.5 of Chapter 6: If F is a vector field of class 
C! whose domain is a simply-connected region R in 
R, then F = V f for some (scalar-valued) function f 
on R if and only if V x F = 0 at all points of R. 


Let S, denote the sphere of radius r with center at the 
origin, oriented with outward normal. Suppose F is of 
class C! on all of R? and is such that 


f F-dS=ar+b 


for some fixed constants a and b. 


(a) Compute 
J / l V- Fav, 
D 


where D = {(x, y, z) | 25 < x? + y? +z? < 49}. 
(Your answer should be in terms of a and b.) 

(b) Suppose, in the situation just described, that F = 
V x G for some vector field G of class C!. What 
conditions does this place on the constants a and b? 


25. 


26. 


27. 


28. 


Let n(x, y, z) beaunit normal to a surface S. The direc- 
tional derivative of a differentiable function f(x, y, z) 
in the direction of n is called a normal derivative of 
f, denoted df/dn. From Theorem 6.2 of Chapter 2, 
we have 
of 
an 


(a) Let S denote the portion of the sphere x? + y? + 
z? =a’ in the first octant (i.e., where x > 0, 
y >0, z > 0), oriented by the unit normal that 
points away from the origin. Let f(x, y,z)= 
In (x? + y? + z’). Evaluate 


[[ Fas. 


(b) Let D denote the piece of the solid ball x? + y? + 


z? <a’ in the first octant; that is, 


Vfen. 


D = {(x, y, Z | xX +y +z <a’, 


x>0,y>20,z2>0} 
Compute /ff,V-(Vf)dV, where f is as in 
part (a). 

(c) Apply Gauss’s theorem to the integral in part (b), 
and reconcile your result with your answer in 
part (a). 

Suppose that f is such that for any closed, oriented 


surface S, 
f) 
f əf as =0. 
S ðn 


(See Exercise 25 for the definition of the normal deriva- 
tive 0f/dn.) Show that then 


3? a? a? 
f rm f g f 
əx? əy? Az? 


(i.e., that f is harmonic). 


Following Proposition 3.4, show that 
1 
div F(P) = lim sf ras, 
v>0 V S 


where S is a piecewise smooth, orientable, closed sur- 
face S enclosing a region D of volume V. (Take S to 
be oriented by outward normal.) The limiting process 
should be assumed to be such that D shrinks down to 
the point P. 


Use the result of Exercise 27 to establish the formula 
for the divergence ofa C! vector field 


F= Fi(x, y, zi + Px, y,z)j + B(x, y, z)k. 
That is, show that 


div F = + 


29. 
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in the following manner: Let P have coordinates 
(xo, yo, Zo) and consider the (small) cube S, of edge 
length a, centered at P with faces parallel to the coor- 
dinate planes. Note that the volume V enclosed by $ 
is a>. It will help to recall that if f(x, y, z) is differen- 
tiable, then 


af Ee f(x + Ax, y,z)— f(x, y, z) 
Ox Ax—>0 Ax 
= fee f (xt+ 4.9.2) — f(x — 4. y,z) 
Ax>0 Ax : 


In this problem, you will use the result of Exercise 27 to 
find an expression for V + F in cylindrical coordinates. 
(See Theorem 4.5 of Chapter 3.) Begin by writing 


F = F, e, + Fo eg + F; ez, 


where F,(r, 0, z), Fo(r, 0, z), and F,(r, 0, z) denote the 
components of F in the e,-, eg-, and e,-directions 
(respectively). Let P have cylindrical coordinates 
(r, 9, z). Consider the small “cylindrical coordinate 
cuboid” S shown in Figure 7.45. The pairs of oppo- 
site faces correspond to values 


r—Ar/2 and r+Ar/2; 
0 — A0/2 and 6+ A6/2; 
z—Az/2 and z+Az/2. 


Note that the volume of the cuboid is approximately 

r A0 Ar Az. 

(a) Approximate ff, F - dS (where S is oriented by out- 
ward unit normal) by noting that each face of S is 
roughly flat with an “obvious” unit normal vector 
and that F is approximately constant on each face. 


(b) Use your answer in part (a) to calculate the diver- 
gence in cylindrical coordinates as 


Figure 7.45 The cylindrical 
coordinate cuboid of 
Exercise 29. 
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30. Use the ideas of Exercises 27 and 29 to calculate the 
divergence in spherical coordinates. (See Theorem 4.6 
of Chapter 3.) You will want to make use of the small 
“spherical coordinate cuboid” S shown in Figure 7.46. 


x 


Figure 7.46 The spherical coordinate 
cuboid of Exercise 30. The volume of 
the cuboid is approximately 
p? sing A0 Ag Ap. 
The pairs of opposite faces of S correspond to values 


p—Ap/2 and p+Ap/2; 
g—Ag/2 and g+Ag/2; 
0 — A0/2 and 6+4A6@/2. 


31. Let F be a vector field of class C! in a neighborhood 
of the point P in R?, and let n be a unit vector drawn 
with its tail at P. Let C be a simple, closed curve such 
that there is an orientable surface S bounded by C that 
contains P and such that n is normal to S$ at P. Orient 
S by using n, and orient C consistently with S. Follow- 
ing Proposition 3.5, show that, if A denotes the area of 
S, then 


1 
n-curl F(P) = imz F -ds. 
= G 


Here the limiting process is assumed to be such that C 
shrinks down to the point P. (See Figure 7.47.) 


n 


P 


C 


Figure 7.47 Figure for 
Exercise 31. 
32. In this problem, you will use the result of Exercise 31 
to determine an expression for curl F in cylindrical co- 
ordinates. Begin by writing 


F = F, e + Fo eg + Fe. 


(a) Find the e,-component of curl F by considering 
the planar path shown in Figure 7.48. The pairs of 
opposite “edges” of the approximately rectangular 


Figure 7.48 The path C of 
Exercise 32(a). 


path C correspond to the values 

r—Ar/2 and r+Ar/2, 
and 

0 — A0/2 and 6+ A0/2 
(all with constant z-coordinate). Note that the area 
enclosed by C is approximately r A0 Ar. Approx- 
imate the line integral fe F - ds by using the fact 
that, for small A0 and Ar, each edge of C is 
roughly straight. Show that 


IF= Lote al TER) 
G oume =n rr A 
(b) Use the path in Figure 7.49 to show that 
1əF; a F, 
e, «curl F = ——— — =e) 
r 00 az 
z 
rA@ 
Az 
an 
y 


x 


Figure 7.49 The path C of 
Exercise 32(b). 


(c) Use the path in Figure 7.50 to show that 


OF, dF, 
eg -curl F = — —. 
Oz or 
Combine this with the results of parts (a) and (b) 


to obtain 
e, reg ez 

curl F = : d/dr d/d0  d/dz 

F, r Fo F, 


z 


(See Theorem 4.5 of Chapter 3.) 


Ar 


Az e 


x 


Figure 7.50 The path C of 
Exercise 32(c). 


33. In this problem, you will determine an expression for 


curl F in spherical coordinates. Let F be a vector field 
of class C!, and write 


F = Fp €p + Fy ey + Fo eg. 
Show that 


e, -curl F = 


eie 
— (sın E E 
me 


psing 


x 


and 


34. 
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wat) a Fp) 
€p ° cur = ; 
pes p sing 30 00 RR 

1f ə aF, 
eg «curl F = (p F,) e] 
: p E "ap 


by using Exercise 31 and the three paths shown in Fig- 
ure 7.51. Conclude that 


ep peg p Sing ey 
|F = 
cur Fry d/dp  3/ðğ 0/00 
F, p Fo psing Fo 


(See Theorem 4.6 of Chapter 3.) 


Of the six planar vector fields shown in Figure 7.52, 
four have zero divergence in the regions indicated and 
three have zero curl. By considering appropriate in- 
tegrals and using the results of Exercises 27 and 31, 
categorize each vector field. 


(d) 


Figure 7.52 Figures for Exercise 34. 
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7.4 Further Vector Analysis; Maxwell’s 
Equations 


In this section, we use Gauss’s theorem and Stokes’s theorem first to prove some 
abstract results in vector analysis and then to study Maxwell’s equations of elec- 
tricity and magnetism. 


Green’s Formulas 


Our purpose in $7.4 is to establish a few fundamental results of vector analysis. 
Throughout the discussion all scalar and vector fields are defined on subsets of R°. 
The following pair of results is established readily: 


THEOREM 4.1 (GREEN’S FIRST AND SECOND FORMULAS) Let f and g be scalar 
fields of class C?, and let D be a solid region in space, bounded by a piecewise 
smooth surface S = 0D, oriented as in Gauss’s theorem. Then we have 


Green’s first formula: 


[[[ve-veav+ [ff rveav = fh rve-as. 


Green’s second formula: 


JI [Ve -VA aV = (P F Vg -8P -as 


PROOF The first formula follows from Gauss’s theorem applied to the vector 
field F = f Vg. (We leave the details to you.) The second formula follows from 
writing the first formula twice: 


[[[vr-veav+ [ff sav = f rve-as: (1) 
[[[vevrav+ fff everav = ff evr-as. (2) 


Now, subtract equation (2) from equation (1). | 


The third of Green’s formulas requires considerably more work to prove. 


THEOREM 4.2 (GREEN’S THIRD FORMULA) Assume f is a function of class C°. 
Then, for 0D = S oriented as in Gauss’s theorem and points r in the interior 


of D, 
sO 
fo) = Ea k- 
1 B 1 _ VIO) 
xh A a). 


In this formula, dV denotes integration with respect to the variables in x = 
(x, y, zZ) G.e., r = (r1, r2, r3) is fixed in the integration), and the symbol V means 
Vx, differentiation with respect to x, y, and z. 


A proof of Theorem 4.2 appears in the addendum to this section. 
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An Inversion Formula for the Laplacian 


Green’s third formula is a type of inversion formula—a formula that enables 
us to recover the values of a function f by knowing certain integrals involving 
its gradient and Laplacian. Green’s third formula is mainly of technical interest 
in proving further results. We use it here to obtain an inversion formula for the 
Laplacian operator. 

We begin by applying the Laplacian V? to Green’s third formula: 


V2 f(x) 

2 2 ace 

Ve FS) = vl- JIR 
4 Vs f(x) 1 l 
ETAO sov (z) as| o 


The trick is to move v2 inside the surface integral, which is justified since x Æ r 
when x varies over S: 


Vx f(x) 1 
ege a Oa) 
f |r — x|| fœ) x|| 
= gv? (a = F009 (T : )) -dS 
s Ilr — xl] iir — x|| 


By direct calculation, V2(1/|lr — x||) = 0 for x Æ r, so 


m (=) 2 
"(r= x] 


Similarly, since f(x) does not involve r, 


v (seov. (g) = ro (i Sai) 


= f(x)Vx v(a : ) 
Ilr — x| 


= 0. 
Therefore, the Laplacian of the original surface integral is 0. We may conclude 
from equation (3) that, for r in the interior of D, 


Vif) =-ZV Ty, more ) 


and we have shown the following: 


THEOREM 4.3 Ifg = V’f for some function f of class C’, then for r in the 


interior of D, 
a= [Loa 


Theorem 4.3 provides an inversion formula for the Laplacian in the following 
sense: If V? f = o, then 


tom- a fff panty tO 


where g is any harmonic function (i.e., g is such that V?g = 0). That is, if the 
Laplacian of f is known, we can recover the function f itself, up to addition of 
a harmonic function. 
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Finally, it can be shown that the result of Theorem 4.3 holds under consider- 
ably less stringent hypotheses than having ¢ be the Laplacian of another function. ! 


Maxwell’s Equations 


Maxwell’s equations are fundamental results that govern the behavior of—and 
interactions between—electric and magnetic fields. We see how Maxwell’s equa- 
tions arise from a few simple physical principles coupled with the vector analysis 
discussed previously. 


Gauss’s law for electric fields If E is an electric field, then the flux of E 
across a closed surface S is 


Flux of E = fb E-dS. (4) 
s 


Applying Gauss’s theorem to formula (4), we find that 


Flux of = |f f vav, (5) 


where D is the region enclosed by S. 
If the electric field E is determined by a single point charge of q coulombs 
located at the origin, then E is given ir 


E(x) = (6) 


where x = x i + y j + z k. In mks units, E is measured in volts/meter. The constant 
€o is known as the permittivity of free space; its value (in mks units) is 


x 


ia Ixl?” 


8.854 x 107!? coulomb? /newton-meter?. 


For the electric field in equation (6), we can readily verify that V -E = 0 wherever 
E is defined. From formulas (4) and (5), if S is any surface that does not enclose 
the origin, then the flux of E across S is zero. 

But now a question arises: How do we calculate the flux of the electric field 
in equation (6) across surfaces that do enclose the origin? The trick is to find 
an appropriate way to exclude the origin from consideration. To that end, first 
suppose that S, denotes a sphere of radius b centered at the origin (i.e., Sp has 
equation x? + y? + z? = b?). Then the outward unit normal to Sp is 

_xityjt+zk |l 


x 
b b 


(See Figure 7.53.) From equation (6), 


q x 1 
E-dS= fb --xd5. 
f, Areo Ws, IXI? b 


On Sp, we have ||x|| = b, so that 


q x x q xl? 
E -dS = --dS= dS 
fh 4T €o fh b> b Ameo ITs, bt 


f= aoe fh 
= dS = dS 
4m Eo ft b4 Areqb* STs, 


= papate area of Sp) 
TE 


Figure 7.53 The sphere S, 
of radius b, centered at the 
origin, together with an 
outward unit normal vector. 2 

= 4r T Ey Pa aia 


' See, for example, O. D. Kellogg, Foundations of Potential Theory (Springer, 1928; reprinted by Dover 
Publications, 1954), p. 220. 


Figure 7.54 The solid region D is 
the region inside S and outside S}. 
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Now, suppose S is any surface enclosing the origin. Let S, be a small sphere 
centered at the origin and contained inside S. Let D be the solid region in R3 
between S, and S. (See Figure 7.54.) Note that V-E = 0 throughout D, since 
D does not contain the origin. (E is still defined as in equation (6).) Orienting 
dD = S U S, with normals that point away from D, we obtain 


fpe-as- ff E-ds = fh g-as= | V-EdV =0, 
s Sp aD D 


using Gauss’s theorem. We conclude that 


gpe-as= 4 (7) 
S €0 


for any surface that encloses the origin. By modifying equation (6) for E appro- 
priately, we can show that formula (7) also holds for any closed surface containing 
a single point charge of g coulombs located anywhere. (Note: The arguments just 
given hold for any inverse square law vector field F(x) = kx/||x||3, where k is a 
constant. See Exercise 13 in this section for details.) 

We can adapt the arguments just given to accommodate the case of n discrete 


point charges. For i = 1, ...,, suppose a point charge of q; coulombs is located 
at position r;. The electric field E for this configuration is 
1 x XxX—r;i 
E(x) = i r 8 
leer re Ra i 


i=l 


For E as given in equation (8), we can calculate that V- E = 0, except at x = r;. 
If S is any closed, piecewise smooth, outwardly-oriented surface containing the 
charges, then we may use Gauss’s theorem to find the flux of E across S by taking 
n small spheres $1, S:,..., Sn, each enclosing a single point charge. (See Fig- 
ure 7.55.) If D is the region inside S but outside all the spheres, we have, by 
choosing appropriate orientations and using Gauss’s theorem, 


peas- fp E-ds = fp E-as=[[f v-eav=o. 
8 i= W Si aD D 


since V- E = 0 on D. Hence, 
n 1 
E-dS= fp B-as = — qi 


1 
= — (total charge enclosed by S). (9) 
€0 


Figure 7.55 D is the solid region inside the 
surface S and outside the small spheres 
S1, So,..., Sn, each enclosing a point charge. 
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To establish Gauss’s law, consider the case not of an electric field determined 
by discrete point charges, but rather of one determined by a continuous charge 
distribution given by a charge density (x). The total charge over a region D in 


space is 
Jff pwav, 


so that, in place of formula (8), we have an electric field, 


o= zz | ff ro. (10) 


In equation (10), the integration occurs with respect to the variables in x. (Note: 
It is not at all obvious that the integral used to define E(r) converges at points 
r € D, where p(r) Æ 0, because at such points the triple integral in equation (10) 
is improper. See Exercise 20 in this section for an indication of how to deal with 
this issue.) 

The integral form of Gauss’s law, analogous to that of formula (9), is 


fresh [fon o 


where S = 0D. If we apply Gauss’s theorem to the left side of formula (11), we 


find that 
1 
Jff xxo =}]ff jav. 
D €0 D 


Since the region D is arbitrary, it may be “shrunk to a point.” From this, we 
conclude that 


Equation (12) is the differential form of Gauss’s law. 


Magnetic fields A moving charged particle generates a magnetic field. To be 
specific, if a point charge of q coulombs is at position rọ and is moving with 
velocity v, then the magnetic field it induces is 


B®) = es (ZE). (13) 


4r Ir — roll? 


In mks units, B is measured in teslas. The constant uo is known as the perme- 
ability of free space; in mks units 


uo = 4r x 1077 N/amp? © 1.257 x 1076 N/amp?. 


In the case of a magnetic field that arises from a continuous, charged medium 
(such as electric current moving through a wire), rather than from a single moving 
charge, we replace q by a suitable charge density function p and the velocity of 
a single particle by the velocity vector field v of the charges. Then we define the 
current density field J by 


I(x) = p(x)v(x). (14) 


Figure 7.56 The 
total current / 
across S is the flux 
of the current 
density J across S. 
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In place of formula (13), we use the following definition for the magnetic field 
resulting from moving charges in a region D in space: 


Bon = ff] owo a 
effo o 


In equation (15), as in equation (10), the integration is with respect to the vari- 
ables constituting x. As in equation (10), it is not obvious that the integrals in 
equation (15) are convergent if r € D, but, in fact, they are. (See Exercise 21 in 
this section.) 

Before continuing our calculations, we comment further regarding the current 
density field J. The vector field J at a point is such that its magnitude is the current 
per unit area at that point, and its direction is that of the current flow. It is not 
hard to see then that the total current 7 across an oriented surface S is given by 


the flux of J; that is, 
r= f [s-as. (16) 
S 
(See Figure 7.56.) 


Returning to the magnetic field B in equation (15), we show that it can be 
identified as the curl of another vector field A (to be determined). First, by direct 


calculation, 
y ( 1 ) r—x 
r = š 
Ilr — x|| Ir — x|| 


Therefore, equation (15) becomes 


all ee ff hpo (i oa) vs 


We rewrite equation (17) using the following standard (and readily verified) iden- 
tity, where f is a scalar field and F a vector field (both of class C°): 


V x (fF) =(V x F)f —Fx Vf. (18) 


Formula (18) is equivalent to 


FxVf=(VxF)f-Vx(fF). 


Therefore, 
1 l J 
I(x) x Ve (r) = (V; x I(x) gee 
|r — x|| |r — x |r — xl 
J(x) 
= — Vr X ; 
|r — xl] 


since J(x) is independent of r. Hence, 


=al] pea ae [hee 


as r does not contain any of the variables of integration. Consequently, 


B(r) = V x A(r), 
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Figure 7.57 The 
closed loop C is 
oriented so that it 
has a right-hand 
relationship with 
the direction of 
current flow it 
encloses. 


where 


an 2 hea a 


Thus, V -B = V -(V x A) and so, by Theorem 4.4 in Chapter 3, we conclude that 


The intuitive content of equation (20) is often expressed by saying that “magnetic 
monopoles” do not exist. 

The vector field A in equation (19) furnishes an example of a vector potential 
of the field B. (See Exercises 33—38 in the Miscellaneous Exercises for more about 
vector potentials.) 


Ampére’s circuital law If C is a closed loop enclosing a current 7, then 
Ampeéere’s law says that, up to a constant, the current through the loop is equal to 
the circulation of the magnetic field around C. To be precise, 


f B-ds = pol. (21) 
Cc 


In equation (21), we assume that C is oriented so that C and I are related by a 
right-hand rule, that is, that they are related in the same way that the orientation of 
C and the normal to any surface S that C bounds are related in Stokes’s theorem. 
(See Figure 7.57.) 

From equation (16) for the total current, equation (21) may be rewritten as 


f Beds=wo | f s-as, 
c s 


where S is any (piecewise smooth, oriented) surface bounded by C. Applying 
Stokes’s theorem to the line integral, we obtain 


[ |5 *B-as= m f f 3-as. 
S s 


Since the loop C and surface S are arbitrary, we conclude that 


Equation (22) is the differential form of Ampère’s law in the static case 
(i.e., in the case where B and E are constant in time). In the event that the magnetic 
and electric fields are time varying, we need to make some modifications. The so- 
called equation of continuity, established in Exercise 5 in this section, states that 

0p 
V-J=-— 23 
ae (23) 
The difficulty is that if V x B = uoJ as in equation (22), then equation (23) 
implies that 
dp 


V-(VxB)=V- (Hod) = —Ho7 


Figure 7.58 The 
rate of change of 
magnetic flux across 
S determines the 
electromotive force 
around the 
boundary C. 
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However, assuming B is of class C?, we must have V -(V x B) = 0, even in the 
case where p is not constant in time. 

The simplest solution to this difficulty is to modify equation (22) by adding 
an extra term. From Gauss’s law, equation (12), we must have 


Thus, if we replace J by J + €9(0E/dr) in equation (22), then we can verify that 
V «(V x B) = 0. (See Exercise 16 in this section.) Hence, Ampére’s law can be 
generalized as 


dE 
V x B= uoJ + uoco -— 


ət 


The term co(9E/3t) in equation (24), known as the displacement current density, 
was first postulated by James Clerk Maxwell in order to generalize Ampère’s law 
to the nonstatic case. (In this context, the original current density field J is known 
as the conduction current density.) 

Equation (24) is not the only possible generalization of equation (22), but 
it is the simplest one and is consistent with observation. See Exercise 17 in this 
section for other ways to generalize equation (22) to the nonstatic case. 


Faraday's law of induction Michael Faraday observed empirically that the 
change in magnetic flux across a surface S equals the electromotive force around 
the boundary C of the surface. This relation can be written as 


d® 
OO = sds: 25 
< $ s (25) 


where &(t) = Sf. s B- dS, and C and S are oriented consistently. (See Figure 7.58.) 
If we apply Stokes’s theorem to the line integral, we find that 


f £-as= ff v xe-as. 
C S 


d® d 0B 
— = — B-dS= — dS, 
dt off II ot 


Since 


we have 


B 
-f 8 as= ff vx E-as. (26) 
s Ot s 


Because equation (26) holds for arbitrary surfaces, we conclude that 
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Figure 7.59 Ifx 
is any point on the 
surface of the 
sphere of radius b 
centered at r, then 
lr — xl| = b. 


Summary Equations (12), (20), (24), and (27) together are known as Maxwell’s 
equations: 


(Gauss’s law); 
(No magnetic monopoles); 


(Faraday’s law); 


dE 
Vx B = uoJ + Hoco (Ampère’s law). 


Maxwell’s equations allow us to reconstruct the electric and magnetic fields from 
the charge and current densities. They are fundamental to the subject of electricity 
and magnetism and provide a fitting tribute to the power of the theorems of Stokes 
and Gauss. 


Addendum: Proof of Theorem 4.2 
The most obvious idea is to use Green’s second formula with 


= a 


However, this function fails to be continuous when x = r, so Gauss’s theorem 
(and hence Green’s formula) cannot be applied so readily. Instead, we need to 
examine the integrals more carefully. 

Throughout the discussion that follows, let S, denote the sphere of radius b 
centered at r. First, we establish some subsidiary results. 


E Lemma 1 Ifh is a continuous function, then 


h 
lim fb ©) as =o 
b>0 JJs, llr — xl] 


PROOF The average value of h on S, is defined to be 


ff, ney as l 
hliye = 2 h(x)dS. 
[I love $ (x) 


surface area of Sp 4mb? 


(See Exercise 9 of the Miscellaneous Exercises.) Thus, 
fp h(x) dS = 4b? [h]ag. 
Sp 


As x varies over the surface Sp, we have ||r — x|| = b. (See Figure 7.59.) Hence, 


h(x) ; 1 , 
lim [iA =i dS = lim f zrs = lim 476 [AJavg = 0. | 


To clarify the variables with respect to which we differentiate, let Vx denote 
the del operator with respect to x, y, and z, and Vy the del operator with respect 
tor = (r1, r2, r3). 


Figure 7.60 The region 

D — B denotes the solid D 
with a small ball centered at r 
removed. 
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m= Lemma 2 With h and S, as in Lemma 1, 


1 
lim ff h(x)Vx () «dS = —4rh(r). 
b>0 JJ s, lr — xl] 


PROOF Let n = (x — r)/||r — x||, the normalization of x — r. Straightforward 


calculations yield 
v, ( 1 ) _ x-r . 
Ilr — xl Ir — xl? 
and 


nva 1 )= (x—r)-(-r) Ix- rl? _ 1 _ 1 


r—x|| Ir — xlt Ir — xlt Ir — xl? b?’ 


for x on Sp. Then 


1 1 
lim dp h(x)Vx (a dS = lim (rov, [=-= ) n) dS 
0 JJ's, ilr — x|| b>0 JJS, |r — x|| 


b> 


; 1 2 
= lim -7 (4xb [A]avg) 
= —4rh(r). 
(See the proof of Lemma 1.) a 


Returning to the proof of Green’s third formula, we look at a region to which 
we can apply Green’s second formula, namely, the region D — B, where B is a 
small ball of radius b centered at r. (See Figure 7.60.) By Green’s second formula 
(since 1/||r — x|| is not singular on D — B), we have 


fff (20°? Goa) - ea” 
D-B lr — x| |r — x|| 
= l / ( IOV ( ! ) Vit =) dS. (28) 
S-S; |r — x|| |r — x|] 


By direct calculation vel /|\|r — x||) = 0, so equation (28) becomes 
TT NfO] 
p-g |r- x4 
= fp ( EA ( : ) a =) -dS. (29) 
S-S, lr — xl] Ilr — x|| 


We may evaluate the right-hand side of equation (29) by replacing the surface 
integral over S — S, by separate integrals over S and S,. Now, we take limits as 
b — 0. By Lemma 1 with h(x) = Vx f(x)-n 


fb Vx F(X) -as= fp ULLS a E 
s, lr — xll s, lle — xl] 


sov ( : )-as > —4r f (r). 
Ir — x|| 


By Lemma 2, 


Sp 
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Since B shrinks to a point as b — 0, we see that equation (29) becomes 


V2 1 Vx 
pE- ff (00% (eta) BL) ess, 
lr — x| s lr — xl lr — xl] 
from which A s third formula follows immediately. E 


7.4 Exercises 


1. Prove Green’s first formula, stated in Theorem 4.1. 


A function g(x, y,z) is said to be harmonic at a point 
(xo, Yo. Zo) if g is of class C and satisfies Laplace's equation 


vp 28, #8 a°g 


mad =0 
əx? əy? Az? 


on some neighborhood of (xo, yo, zo). We say that g is har- 
monic on a closed region D C R? if it is harmonic at all in- 
terior points of D (i.e., not necessarily on the boundary of 
D). Exercises 2—4 concern some elementary vector analysis of 
harmonic functions. 


2. Assume that D is closed and bounded and that 3 D is a 
piecewise smooth surface oriented by outward unit nor- 
mal field n. Let 3g /ðn denote Vg - n. (The term ðg/ðn 
is called the normal derivative of g.) Use Green’s first 
formula with f(x, y,z) = 1 to show that, if g is har- 


monic on D, then 
ð 
f dea: 
aD on 


3. Let f be harmonic on a region D that satisfies the 
assumptions of Exercise 2. 


(a) Show that 


[[[vt-vrav = j sžas. 


(b) Suppose f(x, y,z)=0 for all (x,y,z) € 0D. 
Use part (a) to show that then we must have 
f(x,y,z) = 0 throughout all of D. (Hint: Think 
about the sign of Vf + Vf.) 


4. Let D be a region that satisfies the assumptions of 
Exercise 2. Use the result of Exercise 3(b) to show 
that if fi and f2 are harmonic on D and fi(x, y, z) = 
fa(x, y, z) on OD, then, in fact, fı = fo on all of D. 
Thus, we see that harmonic functions are determined 
by their boundary values on a region. (Hint: Consider 


fi -— fo-) 


5. (a) Suppose a fluid of density p(x, y, z, t) flows with 
velocity field F(x, y, z, t) in a solid region W in 
space enclosed by a smooth surface S. Use Gauss’s 
theorem to show that, if there are no sources or 
sinks, 


This equation is called the continuity equa- 
tion in fluid dynamics. (Hint: The triple integral 
Sify %2 dV is the rate of fluid flowing into W, 
and the flux of pF across S gives the rate of fluid 
flowing out of W.) 

(b) Use the argument in part (a) to establish the equa- 
tion of continuity for current densities given in 
equation (23): 


Let T(x, y, z, t) denote the temperature at the point (x, y, Z) 
of a solid object D at time t. We define the heat flux density H 
by H = —kVT. (The constant k is the thermal conductivity. 
Note that the symbol V denotes differentiation with respect to 
x, y, Z, not with respect to t.) The vector field H represents the 
velocity of heat flow in D. It is a fact from physics that the total 
heat contained in a solid body D having density p and specific 


heat o is 
J / / söt dv. 
D 


Hence, the total amount of heat leaving D per unit time is 


J) Poet? 
OO : 


(Here we assume that o and p do not depend on t.) We also 
know that the heat flux may be calculated as 


J| Has. 


Exercises 6—10 concern these notions of temperature, heat, and 
heat flux density. 


6. Use Gauss’s theorem to derive the heat equation, 
oT 
— =kv’T. 
oe ot 
7. In Exercise 6, suppose that k varies with the points in 
D; that is, k = k(x, y, z). Show that then we have 
oT 
A = kV°T + Vk- VT. 


8. In the heat equation of Exercise 6, suppose that o, p, 
and k are all constant and the temperature T of the 
solid D does not vary with time. Show that then T 
must be harmonic, that is, that V?T = 0 at all points 
in the interior of D. 


9. 


10. 


11. 


(a) Ifo, p, and k are constant and the temperature T 
of the solid D is independent of time, show that 
the (net) heat flux of H across the boundary of D 
must be zero. 


(b) Let D be the solid region between two concentric 
spheres of radii 1 and 2. Suppose that the inner 
sphere is heated to 120° C and the outer sphere to 
20° C. Use the result of part (a) to describe the rate 
of heat flow across the spheres. 


Consider the three-dimensional heat equation 


ou 


Vu = — 
ot 


(30) 
for functions u(x, y,z,f). (Here V7u denotes the 
Laplacian 07u/dx? + d7u/dy? + d7u/dz7.) In this ex- 
ercise, show that any solution T(x, y, z, t) to the heat 
equation is unique in the following sense: Let D be a 
bounded solid region in R? and suppose that the func- 
tions a(x, y, z) and @(x, y, z, t) are given. Then there 
exists a unique solution T(x, y, z, t) to equation (30) 
that satisfies the conditions 


T(x, y,z,9)=a(x, y,z) for(x,y,z)€ D, 
(31) 


and 


for (x, y,z)€ ƏD 
andt > 0. 


T(x, y,Z,t) = (x, y, z, t) 


To establish uniqueness, let T; and T, be two solutions 

to equation (30) satisfying the conditions in (31) and 

set w = T, — Th. 

(a) Show that w must also satisfy equation (30), plus 
the conditions that 


w(x, y,z,0)=0 forall (x, y,z) € D, 
and 
w(x, y,z,t)=0 forall (x, y,z) € dD andt > 0. 
(b) For t > 0, define the “energy function” 


_! 2 
E(t)= s/f ples Ys 2 dV. 


Use Green’s first formula in Theorem 4.1 to show 
that E’(t) < 0 (i.e., that E does not increase with 
time). 

(c) Show that E(t) = 0 for all t > 0. (Hint: Show that 
E(0) = 0 and use part (b).) 

(d) Show that w(x, y,z,t)=0 for all t> 0 and 
(x, y,z) € D, and thereby conclude the unique- 
ness of solutions to equation (30) that satisfy the 
conditions in (31). 


Show that Ampére’s law and Gauss’s law imply the 
continuity equation for J. (Note: In the text, we use the 
continuity equation to derive Ampère’s law.) 


12. 


13. 


14. 


15. 
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Suppose that E is an electric field, in particular, a vector 
field that satisfies the equation V - E = p/eo. A region 
D in space is said to be charge-free if p is zero at 
all points of D. Describe the charge-free regions of 
E = (x? — x)i + ty? j + ($2? — 2z) k. 

By considering the derivation of Gauss’s law for elec- 
tric fields, show that, for any inverse square vector 
field F(x) = kx/||x||?, the flux of F across a piecewise 
smooth, closed, oriented surface S is 


0 
F-dS= 
fh S ia 


Leta point charge ofq coulombs be placed at the origin. 
Recover the formula 


if S does not enclose the origin, 
if S encloses the origin. 


q X 
© 4reo |ixl? 

by using Gauss’s law in the following way: 

(a) First, explain that in spherical coordinates, E(x) = 
E(x)e,, that is, that E has no components in either 
the e,- or eg-direction. Next, note that E(x) may 
be written as E(p)—that is, that ||E || has the same 
magnitude at all points on a sphere centered at the 
origin. 


(b) Show, using Gauss’s law and Gauss’s theorem, that 


fp E(p)e, dS = £, 
S €0 


where S is any smooth, closed surface enclosing 
the origin. 


(c) Now let S be the sphere of radius a centered at 
the origin. Then the outward unit normal n to S at 
(p, , @) is ep. Show that 


fp E(p)ds = L. 
S €0 


(d) Use part (c) to show that E(p) = q /(47 €0p°). 
Conclude the result desired. 


(a) Establish the following identity for vector fields F 
of class C?: 
V x (V x F) = V(V - F) — V’F. 
(Note: V?F = (V » V)F.) 
(b) In free space (i.e., in the absence of all charges and 


currents), use Maxwell’s equations to show that E 
and B satisfy the wave equation 


a°F 
VF =k—., 
ar? 
where k is a constant. What is k in each case? 
(c) Use part (a), Faraday’s law, and Ampère’s law to 
show that 


0 OE 
V(V -E)—(V°- V)E = —uo— |J ——ll\ 
(V-E)- (V-V) wo [itea] 
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(d) Show that, in the absence of any charges (i.e., if 
p = 0), 


Verify that if the nonstatic version of Ampére’s law 
(equation (24)) holds, then V -(V x B) = 0. 


When Maxwell postulated the existence of displace- 
ment currents to arrive at a nonstatic version of 
Ampeéere’s law, he was simply choosing the simplest 
way to correct equation (22) so that it would be consis- 
tent with the continuity equation (23). However, other 
possibilities are also consistent with the continuity 
equation. 

(a) Show that in order to have equation (22) valid in 

the static case, then, in general, we must have 
oF, 


V x B= uoj + — 
Ho +- 


for some (time-varying) vector field F; of class C?. 


(b) By taking the divergence of both sides of the equa- 
tion in part (a), show that 
OF, JE 
a = HLoeoV + ae! 
(c) Use part (b) to argue that, from an entirely math- 
ematical perspective, Ampére’s law can also be 
generalized as 


JE 
VxB= Hod + Moeo—— + Fy, 


where F; is any divergence-free vector field. Since 
no one has observed any physical evidence for F2’s 
being nonzero, it is assumed to be zero, as in equa- 
tion (24). 


Suppose that J = oE. (This is a version of Ohm’s law 
that obtains in some electric conductors—here o is a 
positive constant known as the conductivity.) If o = 0, 
show that E and B satisfy the so-called telegrapher’s 
equation, 

°F 


VF di + — 
a a= a e 
Lo at Lo€o ar 


The function X: R? —> R? given by X(s, t) = (2s + 
3t +1,4s —t,s +2t— 7) parametrizes the plane 
9x — y — 14z = 107. 


The function X: R? —> R? given by X(s, t) = (s? + 
3t — 1, s? +3, —2s2 + t) parametrizes the plane x — 
Ty —3z+22=0. 


19. 


20. 


21. 


Let E and B be steady-state electric and magnetic fields 
(i.e., E and B are constant in time). The Poynting vec- 
tor field P = E x B represents radiation flux density. 
Use Maxwell’s equations to show that, for a smooth, 
orientable, closed surface S bounding a solid region D, 


fpr -ds=-v fff e-sav. 


Consider the electric field E(r) defined by equation 

(10). Note that the integrals in equation (10) are im- 

proper in the sense that they become infinite at points 

r € D, where p(r) is nonzero. In this exercise, you will 

show that, nonetheless, the integrals in equation (10) 

converge when D is a bounded region in R? and p isa 

continuous charge density function on D. 

(a) Write E(r) in terms of triple integrals for the in- 
dividual components. Let r = (r1, r2, r3) and x = 
(x, y, z). 

(b) Show that if each component of E is written in the 
form fff fœŒdV, then | f(x)| < K/|lr— x’, 
where K is a positive constant. 


(c) It follows from part (b) that if 


JJl 


converges, so must f ffp f(x) dV. Show that 


JJ], m” 


converges by considering an iterated integral in 
spherical coordinates with origin at r. (Hint: Look 
carefully at the integrand in spherical coordinates.) 


Consider the magnetic field B(r) defined by equation 
(15). As was the case with the electric field in equation 
(10), itis not obvious that the integrals in (15) converge 
at all r € D. Follow the ideas of Exercise 20 to show 
that B(r) is, in fact, well-defined at all r, assuming a 
continuous current density field J and bounded region 
D in R°. 


TT Pi 


. The function X:(—0o, 00) x (—2, 2) > R? given 


24 2 
by X(s,t) = (s? +3tant — 1,59 +3, —2s3 + tant) 
parametrizes the plane x — 7y — 3z + 22 = 0. 


. The surface X(s, t) = (s°t, st?, st) is smooth. 


10. 


11. 


12. 


13. 


14. 


15. 
16. 
17. 
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. The area of the portion of the surface z = xe*” lying 


over the disk of radius 2 centered at the origin is given 
by 


2 pV4-x* 
| f Jl +e% (x4 + x?y? +2xy + l)dydx. 
o Jo 


. If S is the unit sphere centered at the origin, then 


if dS =0, 


. If S is the cube with the eight vertices (+1, +1, +1), 


then ff, + x7 y)dS = 0. 


. If S denotes the rectangular box with faces given 


by the planes x =+l1, y=+2, z= 343, 
[{exyzdS = 0. 


then 


. If S denotes the sphere of radius a centered at the 


origin, then 


[[@-e+nas=[fo—ss +s, 


Jfs(-yi+ xj)-dS =0, where S is the cylinder 
x? +y? =9,0<z<5. 

Let S$ denote the closed cylinder with lateral sur- 
face given by y? + z? = 4, front by x = 7, and back 
by x = —1, and oriented by outward normals. Then 
foxi. dS = 24r. 


If S is the portion of the cylinder x+y = 16, 
—2 <z < 7, then ffy V x (yi)-dS = 0. 


[fs F -dS = 6r, where S is the closed hemisphere 
x? +y? +z? = 1, z > 0, together with the surface 
x? +y? < 1,z = 0 and F = yzi — xzi + 3k. 

If S is the level set ofa function f(x, y, z)and V f 4 0, 
then the flux of V f across S is never zero. 


A smooth surface has at most two orientations. 
A smooth, connected surface is always orientable. 


If F is a vector field of class C! and S is the ellipsoid 
x? + dy? + 92? = 36, then ff, V xF -dS = 0. 


Jf; V x F+dS has the same value for all piecewise 
smooth, oriented surfaces S that have the same bound- 
ary curve C. 


Figure 7.61 shows the plots of six parametrized sur- 
faces X. Match each parametric description with the 
correct graph. 

(a) X(s, t) = (t(s? — t), s, — t°) 

(b) X(s, t) = (s cost, s sint, s) 


(c) X(s, t) = ((2 + cos s) cost, (2 + cos s) sint, t + sins) 


(d) X(s, t) = (sins cost, s, sin s sin t) 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 
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IfF is a constant vector field, then fs F -+ dS = 0, where 
S is any piecewise smooth, closed, orientable surface. 


fs V x F-dS = 0, where S is any closed, orientable, 
smooth surface in R? and F is of class C!. 


Suppose that F is a vector field of class C! whose do- 
main contains the solid region D in R? and is such that 
|| F(x, y, z)|| < 2 at all points on the boundary surface 
S of D. Then [ffp V -FdV is twice the surface area 
of S. 


If S is an orientable, piecewise smooth surface and F 
is a vector field of class C! that is everywhere tangent 
to the boundary of S, then ff, V x F-dS = 0. 


If S is an orientable, piecewise smooth surface and F 
is a vector field of class C! that is everywhere perpen- 
dicular to the boundary of S, then th VxF-dS=0. 


If F is tangent to a closed surface S that bounds a solid 
region D in R°, then [ffp V-FdV = 0. 


Let S be a piecewise smooth, orientable surface and 
F a vector field of class C!. Then the flux of F across 
S is equal to the circulation of F around the boundary 
of S. 


Let D be a solid region in R? and F a vector field of 
class C!. Then the flux of F across the boundary of D 
is equal to the integral of the divergence of F over D. 


Suppose that f and g are of class C? and D is a solid 
region in R? with piecewise smooth boundary surface 
S that is oriented away from D. If g is harmonic, then 


Ilo Vf -Vgdv = ff, fVg-ds. 


Suppose that f and g are of class C? and D isa solid 
region in R? with piecewise smooth boundary sur- 
face S that is oriented away from D. If f and g are 
harmonic, then ff, fVg-dS = — fy 8V f + dS. 


If V? f is known, then f is uniquely determined up to 
a constant. 


If S is a closed, orientable surface, then 


bos -dS=0 
s Ixl? 


(e) X(s, t) = (cos: + 0.1t (cosrcoss + 


| oo t ; ; 
a sins sins) , SIN É +0.1t (sinr coss 


V5 
1 E ) t 0.21 , ) 
— — posi SINS J; = — sins 
V5 2 V5 


(£) X(s, t) = (s cost, s sint, t) 
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2. Consider the unit sphere S with equation x? + y? + 
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Figure 7.61 Figures for Exercise 1. 


(b) Now define X: R? —> R? by letting X(s, t) be the 
point of intersection of § and the line joining 
(s,t,0) and (0, 0,1). Write a set of parametric 
equations for the line joining (s, t, 0) and (0, 0, 1) 
and use it to give a formula for X(s, t). 


z? = 1. Inthis problem, you will provide a parametriza- 
tion for (almost all of) the sphere that is different from 
the one given in Example 2 of §7.1. 


(a) First consider the parametrized plane in R°: 


D = {(s, t, 0) | (s,t) € RÈ}. (c) Show that X(s, t) is a smooth parametrization of 

almost all of S. What points of S are not included 

Note that D is just a copy of R? sitting in R3 in the image of X? For evident geometric reasons, 

as the xy-plane. For any point (s, t,0) € D, ar- the map X defined in this problem has an inverse 

gue geometrically that the line through (s, t, 0) and map from (almost all of) the sphere to the xy-plane, 

(0, 0, 1) intersects S in a point other than (0, 0, 1). called stereographic projection of the sphere onto 
(See Figure 7.62.) the plane. 


3. (a) Provide a parametrization for the hyperboloid x? + 
y? — z? = 1. (Hint: Use the cylindrical coordinates 
z and @ for parameters.) 


(0, 0, 1) 
s (b) Modify your answer in part (a) to give a para- 
metrization of the hyperboloid 


xX 


a2 


(c) Let xo and yo be such that Xo + yo = 1. Show that 
the lines 


Figure 7.62 Figure for Exercise 2. l(t) = (a(xo — yot), b(xot + yo), ct) 


and 


l(t) = (a(yot + xo), b(Vo — xot), ct) 


lie in the hyperboloid of part (b). 


(d) Show that the lines l; and l of part (c) also lie in 
the plane tangent to the hyperboloid at the point 
(axo, byo, 0). 


. Find the surface area of the portion of the hyperboloid 
x? +y? — z? =1 between z = —a and z =a. (See 
Exercise 3(a).) 


. (a) Parametrize the ellipsoid 


2 yy z 
z T p F z= i 
(b) Use your answer in part (a) to set up an integral 
for the surface area of the ellipsoid. Do not evaluate 
this integral, but verify that it indicates correctly 
the surface area in the case that the ellipsoid is a 
sphere of radius a. 


. Let y = f(x), a < x < b, be a curve in the xy-plane. 

Suppose this curve is revolved around the x-axis to 

generate a surface of revolution. 

(a) Explain why X(s,t)= (s, f(s)cost, f(s)sint) 
parametrizes the surface so described. (Hint: Con- 
sider the t-coordinate curve.) 


(b) Verify that the area of the surface is 


b 
on f IFTE OF dx. 


. Let the curve y = f(x), 0 <a <x < b, be revolved 
around the y-axis to generate a surface. 


(a) Find a parametrization for the surface. 
(b) Verify that the area of the surface is 


b 
ax f xy 1 +F dx. 


. Let S denote the surface defined by the equation 
z= f(x), a <x <b. (The surface is a generalized 
cylinder over the curve z = f(x) in the xz-plane.) Let 
C denote a piecewise C!, simple, closed curve in the 
xy-plane. Let D denote the region in the xy-plane 
bounded by C and assume that every point of D has x- 
coordinate between a and b. Let Sı denote the portion 
of S lying over D. 


(a) Show that the portion of S lying over D is 


| i s(x) dA, 


where s(x) = f> /1 + (f/(t)) dt; that is, s is the 


arclength function of the curve z = f(x). 
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(b) Use part (a) to show that the surface area may also 
be calculated from the line integral 


§ s(x) dy, 
C 


where C is oriented counterclockwise. 
(c) Compute the surface area of the portion of 


x 1 


C= Lee eS, 


lying over the rectangle in the x y-plane having ver- 
tices (1, +2), (3, +2). 


Let f be a piecewise smooth surface in R? and f: X C RÈ > 
Ra scalar-valued function whose domain X contains S. Then 
the average value of f on S is the quantity 


“pg hatas _ Ila t as 
H Sfzds "area of S` 


Exercises 9-11 involve the notion of the average value of a 


function on a surface. 


9. (a) Explain why the definition of the average value 
makes sense. 


(b) Suppose that the temperature at points on the 
sphere x? + y? + z? = 49 is given by T(x, y, z) = 
x? + y? — 3z. Find the average temperature. 
10. Find the average value of f(x, y, z) = x?e — y*z on 
the cylinder x? + y? =4,0<z <3. 
11. Find the average value of f(x, y, z) = x? + y? — 3 on 
the portion of the cone z? = 4x? + 4y?, —2 < z <6. 
12. A thin film is made in the shape of the helicoid 
X(s, t) = (s cost, s sint,t), 0<s<1,0<tx<4r. 


Suppose that the mass density (per unit area) at each 
point (x, y, z) of the film varies as 


5(x, y, Z) = y x? + y?. 
Find the total mass of the film. 


Let S be a piecewise smooth surface in RÌ. Suppose that the 
mass density at points (x, y, z) of S is 6(x, y, z). Using formu- 
las analogous to those in §5.6, we define the (first) moments 


of S to be 
| | beds. / | yô(x, y, 2dS, 
Ss S 
aii / f 28(x, y, 248. 
6 


Exercises 13—16 involve first moments and centers of mass of 
surfaces. 


13. Find the center of mass of the first octant portion of the 
sphere 
P+y42 =a’, 


assuming constant density. 


526 


Chapter 7 | Surface Integrals and Vector Analysis 


14. Find the center of mass of the piece of the cylinder 


r+ =a’, O0O<y<a,z>0. 


Assume that the density of the surface is constant. 
15. Find the center of mass of a sphere of radius a, where 


the density 5 varies as the square of the distance from 


the “south pole” of the sphere. 
16. Find the center of mass of the cylinder x? + z? = a’, 
between y= 0 and y = 2, if the density varies as 


=x +y. 


Given a piecewise smooth surface S, the moment of inertia 1, 
of S about the z-axis is defined by the surface integral 


I = Jfe + yd dS, 
S 


where ô(x, y, z) is mass density. The corresponding radius of 
gyration about the z-axis is given by 


I, 
r: =| —, 


M 


where M = [[,5dS. Likewise, the moments of inertia of S 
about the x- and y-axes are given by, respectively, the surface 
integrals 


L = [ [o+ asas andy = f [œ+ asas. 
S S 


(Compare these formulas with the ones in §5.6.) Exercises 17— 
19 concern moments of inertia and radii of gyration of surfaces. 


17. (a) Calculate /, for the surface S cut from the cone 
z2 = 4x? 4 dy? 


by the planes z = 2 and z = 4. (This surface is 
known as a frustum of the cone.) Assume density 
is equal to 1. 

(b) Find the radius of gyration r, of the frustum. 

(c) Repeat parts (a) and (b), assuming that the density 
at a point is proportional to the distance from that 
point to the axis of the cone. 


18. Let S denote the cylindrical surface with equation 
x? + y? = a°, where —b < z < b (a, b positive con- 
stants). Assume that the density ô is constant along 
S. 


(a) Find the moment of inertia of S about the z-axis. 
(b) Find the radius of gyration of S about the z-axis. 
19. Let S be as in Exercise 18. 


(a) Find the moments of inertia J, and J, of S about 
the x- and y-axes. 


(b) Find the radii of gyration ry and ry. 


20. (a) Prove the following mean value theorem for double 
integrals: If f and g are continuous on a compact, 


21. 


22. 


23. 


24. 


25. 


26. 


connected region D C R?, then there is some point 
P € D such that 


J| teaa=sre ff gaa 


(Hint: Consider the ratio ffp fgdA/JfpgdA 
and use the intermediate value theorem.) 

(b) Use the result of part (a) to prove the following: 
Let S be a smooth surface oriented by unit normal 
n and let F be a continuous vector field on S. As- 
sume that S may be parametrized by a single map 
X: D — RÈ. Then there is some point P € S such 
that 


Tf F-dS = [F(P)- n(P)](area of S). 
s 


Let a be a constant vector and C a smooth, simple, 
closed curve. Show that f, a + ds = 0 in two ways: 


(a) directly; 
(b) by assuming that C is the boundary of a smooth 
surface S. 


Evaluate §.(x? + 2?)dx + ydy + zdz, where C is 
the closed curve parametrized by the path x(t) = 
(cost, sint, cos? t — sin? t). 


Let f and g be functions of class C?, and let S be a 
piecewise smooth, orientable surface. Show that 


f crve)-as= ffos x Vg). dS. 


Let f and g be functions of class C?, and let S be a 
piecewise smooth, orientable surface. Show that 


P (199+ 8Vf)-ds=0. 


(Hint: Use Exercise 23.) 


Let f be of class C?, and let S be a piecewise smooth, 
orientable surface. Show that 


§ (fV f)+ds = 0. 
os 


Let C be a simple, closed, piecewise C! planar curve 
in R°. That is, C is contained in some plane in R°. Let 
n=ai+bj-+ck denote a unit vector normal to the 
plane containing C, and let C be oriented by a right- 
hand rule with respect to n. 


(a) Show that 
1 
z § (bz — cy)dx + (cx — az)dy + (ay — bx)dz 
Cc 
= area enclosed by C. 


(b) Now show that the result of part (a) reduces to 
something familiar in the case that C is a curve in 
the xy-plane. (See Example 2 of §6.2.) 


27. Suppose S is a piecewise smooth, orientable surface 
with boundary 0S. Use Faraday’s law to show that if E 
is everywhere perpendicular to 0S, then the magnetic 
flux induced by E does not vary with time. 


28. Let G be a vector field of class C?. Then by Theorem 
4.4 in Chapter 3, V -(V x G) = 0. Therefore, Gauss’s 
theorem implies that 


[[oxe-as=[[ [v7 xeav =o. 


Then Stokes’s theorem yields 


o= | fv x6)-as= G-ds. 
Ss 0s 


Hence, all vector fields G of class C? are conservative. 
How can this be? 


Recall that we measure an angle in radians as follows: Place 
the vertex of the angle at the center O of a circle of radius a, 
so that the angle subtends an arc of the circle of length s. Then 
the measure of the angle in radians is 


g=-. 
a 
We can do something similar in the three-dimensional case by 
defining a solid angle as the set of rays beginning at the center 
O of a sphere of radius a, so that the rays cut out a portion 
of the sphere having surface area A. Then the measure of the 
solid angle in steradians is 


gaí 
= 


(See Figure 7.63.) Thus, the solid angle of the entire sphere of 
radius a is 4T. 


Figure 7.63 A solid angle 
measured on a sphere. 


Now suppose that S is a smooth, oriented surface and that a 
point O in RÈ is chosen that is not in S and so that every line 
through O intersects S at most once. In this case, we define 
the solid angle relative to O subtended by S as the set of rays 
beginning at O that pass through a point of S. We measure this 
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solid angle by calculating 


x 
as,0)= f| as. (1) 


where x denotes the (varying) vector from O to a point P in S 
and S is oriented by a normal that points away from O. 
Exercises 29-32 develop some ideas regarding solid angles. 


29. In this problem we’ll see how the measure of the 
solid angle given in equation (1) can be related to the 
more geometric notion of measuring solid angles using 
spheres. To that end, in the situation described above, 
construct a sphere S, of radius a centered at O. Let Sy 
denote the intersection of Są and the solid angle of S$ 
relative to O. (See Figure 7.64.) By applying Gauss’s 
theorem to the solid region W between S and Š, show 
that 


surface area of S, 
2 


Q(S, 0) = 


Figure 7.64 A solid angle subtended by a 
surface S. 


30. If S is parametrized by X: D C R? => R, and O de- 
notes the origin in R?, we may use equation (1) to define 
the measure of the solid angle subtended by S for very 
general surfaces, including ones that may have some 
self-intersections. Show that for such a parametrized 
surface Q(S, O) may be calculated as 


1 
Q(S, o= ff @+y4+ aR 


x y z 
əx dy əz 

x det} 3s ds das |dsdt. 
ax dy əz 
Ot at at 


31. Suppose that S is closed, oriented, and forms the bound- 
ary of a solid, bounded, simply-connected region W in 
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R°. Using equation (1) to define the measure of the 
solid angle, show that 


+4r 
0 if S does not enclose O. 


if S encloses O 
Q(S, 0) = 


Suppose that S is the circular disk of radius a in the xy- 
plane and centered at the origin. Let O be a moving 
point along the z-axis and denote it by (0, 0, z). Use 
equation (1) to show that —2m < Q(S, O) < 27. In 
addition, show that Q(S, O) jumps by 47 as O passes 
through S. 


Prove the following: Let F be a vector field of class 
C! defined on R?. If V-F = 0, then there is some 
vector field G of class C! such that F = V x G. This 
result provides a converse to Theorem 4.4 of Chapter 3. 
(Hint: Let 


1 
G(x, y,z)= f tF(xt, yt, zt) x rdt, 
0 


where r=xi+yjt+zk. Show VxG=F. The 
identities 


V x (A x B) = AV-B-BV-A 
+(B-V)A— (A- V)B, 
d 
g Eh yt, zt)] = (r° V)E(xt, yt, zt) 
+E(xt, yt, zt), 


and 


ka [P Fet, yt, zt)] =t f ma yt, zt)] 
dt = dt Egs 


+ F(xt, yt, an) 


will prove useful. Also, let X 
and note that, by the chain rule, 
OF OF OX _ ; oF 
dx aX dx ax 


xt, Y= yt, Z= zt 


> 


etc.) 


The vector field G defined in Exercise 33 is called a vector 
potential for the vector field F. In Exercises 34-36, determine 
a vector potential for the given vector field or explain why such 
a potential fails to exist. 


34. 
35. 
36. 
37. 


F = 2xi-yj-—zk 
F=xi+yjt+zk 
F = 3yi + 2xzj — 7x°yk 


The vector potential G identified in Exercise 33 is not 
unique. In fact, show that if G is a vector potential for F, 
then so is G + V@, where ¢ is any scalar-valued func- 
tion of class C?. (This is known as the gauge freedom 
in choosing the vector potential.) 


38. Suppose that 


GMm 
= —-—r 
lri? 

is the gravitational force field defined on R? — 

{(0, 0, 0)}. 

(a) Show that V - F = 0 by direct calculation. 

(b) Show that F 4 V x G for any C! vector field G 
defined on R? — {(0, 0, 0)} by using Stokes’s the- 
orem. (Hint: Take a sphere S enclosing the origin 
and break it up into the upper and lower hemi- 
spheres. Consider ff, F + dS as the sum of the sur- 
face integrals over the two hemispheres.) 

(c) Why do parts (a) and (b) not contradict the result 
of Exercise 33? 


In Exercises 39-44 below, you will derive a type of wave equa- 
tion and see how Maxwell's equations can be reduced to this 
wave equation. Assume that the electric and magnetic fields E 
and B are defined on a simply-connected region in RÌ; also 
let the symbol V denote differentiation with respect to x, y, z 
(i.e., not with respect to t). 


39. 


40. 


41. 


42. 


Recall that equation (20) in §7.4 was derived by show- 
ing that the magnetic field B had a vector potential 
A of class C! (see Exercise 33 above); that is, that 
B= V x A. Use Faraday’s law (equation (27)) to show 
that the vector field E + 0A/dt is conservative. Hence, 
we may write 
E+ Aa Vf 
or 
for an appropriate scalar-valued function f(x, y, z, t). 


Use the vector potential A for B in Ampére’s law (equa- 
tion (24) of §7.4) to conclude that 


2 


VA ao = J+V([V-A- af 
Hooma 7 Ho Hoo sy . 


(2) 
(Hint: See part (a) of Exercise 15 of §7.4.) 
Use Gauss’s law (equation (12) of §7.4) to show that 


ə 
Vf =L +2 (V-A). 8) 
€0 ot 

As noted in Exercise 37, the vector potential A is only 
unique up to addition of Vø, where (x, y, z, t) is a 
scalar-valued function of class C. That is, any vector 
field A = A+ V¢ also works as a vector potential for 
B. However, the function f that arises in Exercises 39— 
41 will change. 

(a) Show that the function f associated to A is related 

to f as 


> ð$ 
re 


(b) Show that the condition 


R af 
V-A= —, 
Lo€eo at 


where A = A + V@, is equivalent to the existence 
of solutions to the (inhomogeneous) wave equation 


ao a 
V'p — Moo = (v-a Ho€o a J (4) 


Given A and f it is possible to solve equation 
(4) for ġ, so we may assume that the condition 
V-A= Lo€00f /dt holds. 


43. 


44. 
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Given the condition V • A = uocoðf/ðt, show that 
equations (2) and (3) become 


aA 
VA Moco = — poj; (5) 
Pf p 
V? f — L, 6 
f — Lo€o r~a (6) 


Conversely, suppose that A and f satisfy the condition 
V-A= uocoð f /ðt and equations (5) and (6). Show 
that then E = —dA/dt+ Vf and B= V x A must 
satisfy Maxwell’s equations. Hence, solutions to (4) 
enable us to define a vector field A and a scalar field 
f from which we may construct E and B that satisfy 
Maxwell’s equations. 
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Higher Dimensions 


Introduction 


In this concluding chapter, our goal is to find a way to unify and extend the 
three main theorems of vector analysis (namely, the theorems of Green, Gauss, 
and Stokes). To accomplish such a task, we need to develop the notion of a 
differential form whose integral embraces and generalizes line, surface, and 
volume integrals. 


8.1 An Introduction to Differential Forms 


Throughout this section, U will denote an open set in R”, where R” has coor- 
dinates (x1, x2,...,Xn), as usual. Any functions that appear are assumed to be 
appropriately differentiable. 


Differential Forms 


We begin by giving a new name to an old friend. If f: U C R” — Risa scalar- 
valued function (of class C*), we will also refer to f as a differential 0-form, 
or just a 0-form for short. 0-forms can be added to one another and multiplied 
together, as well we know. 

The next step is to describe differential 1-forms. Ultimately, we will see that a 
differential 1-form is a generalization of f(x) dx—that is, of something that can 
be integrated with respect to a single variable, such as with a line integral. More 
precisely, in R”, the basic differential 1-forms are denoted dx,, dx2,..., dXp. 
A general (differential) 1-form œw is an expression that is built from the basic 
1-forms as 


æ = F\(x1,.--,Xn) dx, + Fo(xyq,..-, Xn) dx. +... + Fa(x1, -< <, Xn) dXn, 


where, for j = 1,...,, F; is a scalar-valued function (of class ConU CR". 
Differential 1-forms can be added to one another, and we can multiply a 0-form 
f and a 1-form œ (both defined on U C R”) in the obvious way: If 


w = Fi dx, + Fo dx +--+ Fy dxn, 
then 


fo = fF dx, + fFodx.+---+ fFi dy. 
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EXAMPLE 1 InR’, let 
w = xyz dx + z? cos ydy + ze* dz and n = (y —z)dx + z? sin y dy — 2 dz. 
Then 

@ +n = (xyz + y — z)dx + 2(cos y + sin y)dy + (ze* — 2) dz. 
If f(x, y, z) = xe” — z, then 

fa = (xe — z)xyzdx + (xe — z)z’ cos y dy + (xe — z)ze*dz. + 

Thus far, we have described 1-forms merely as formal expressions in certain 

symbols. But 1-forms can also be thought of as functions. The basic 1-forms 


dx,,...,dX, take as argument a vector a = (a1, d2,..., an) in R”; the value of 
dx; on ais 


dx;(a) = aj. 


In others words, dx; extracts the ith component of the vector a. 
More generally, for each xo € U, the 1-form œ gives rise to a combination 
@x, Of basic 1-forms 


Ox, = F\ (Xo) dx} ae Fa (Xo) dXn5 
x, acts on the vector a € R” as 


@x,(a) = F\(X0) dx) (a) + Fo(X0) dx2(a) + +++ + Fy(X0) dxn(a). 


EXAMPLE 2 Suppose w is the 1-form defined on R? by 
w = x yz dx + y’zdy — 3xyz dz. 
If xo = (1, —2, 5) and a = (a1, a2, a3), then 
(1,-2,5)(a) = —10 dx(a) + 20 dy(a) + 30 dz(a) 
= — 10a; + 20a2 + 30a3, 
and, if xọ = (3, 4, 6), then 
(3,4,6)(a) = 216 dx(a) + 96 dy(a) — 216 dz(a) 
= 216a; + 96a — 216a3. 


The notation suggests that a 1-form is a function of the vector a but that this 
function varies from point to point as xo changes. Indeed, 1-forms are actually 
functions on vector fields. + 


A basic (differential) 2-form on R” is an expression of the form 
di N\dxj, i,j =1,...,n. 


It is also a function that requires two vector arguments a and b, and we evaluate 
this function as 


dxj(a) — dx;(b) 


dx; ^ dx (a,b) = dx;(a) dxb) |’ 


(The determinant represents, up to sign, the area of the parallelogram spanned 
by the projections of a and b in the x;x;-plane.) It is not difficult to see that, for 
beg = Vy agen 
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dxi ^dxj = —dxj ^ ax; 


dxi \ dx; = 0. 


Formula (1) can be established by comparing dx; A dxj;(a,b) with dx; A 
dx;(a, b). Formula (2) follows from formula (1). Given formulas (1) and (2), 
we see that we can generate all the linearly independent, nontrivial basic 2-forms 
on R” by listing all possible terms dx; A dx;, where i and j are integers between 
l andn withi < j: 


dx; Adx2, dx; A dx3,...,dx1 A dXxn, 
dx ^A dx3,...,dXx. ^ dXp, 
dXn—| \ dXp. 


To count how many 2-forms are in this list, note that there are n choices for dx; 
andn — 1 choices for dx; (so that dx; # dx; in view of (2)), and a “correction” 
factor of 2 so as not to count both dx; A dx; and dx; A dx; in light of (1). Hence, 
there are n(n — 1)/2 independent 2-forms. 

Let x = (x1, X2,...,%,). A general (differential) 2-form on U C R” is an 
expression 


w = F,2(x) dx, A dx + F\3(x) dx, A dx3 + +++ + Fy-in(X) dxn-1 A dxn, 


where each Fj; is a real-valued function F;;: U C R” —> R. The idea here is to 
generalize something that can be integrated with respect to two variables—such 
as with a surface integral. 


EXAMPLE 3 In R?, a general 2-form may be written as 
Fi(x, y, z)dy Adz + P(x, y, z)dz ^ dx + F5(x, y, z)dx Ady. 

The reason for using this somewhat curious ordering of the terms in the sum will, 
we hope, become clear later in the chapter. + 

Given a point xọ € U C R”, to evaluate a general 2-form on the ordered pair 
(a, b) of vectors, we have 

wx (a, b) = Fi2(xo)dxı A dx2(a, b) + Fi3(xX0)dxı A dx3(a, b) 
a Ht Fn—in(Xo) dXn-1 A dx,(a, b). 

EXAMPLE 4 InR’, leto = 3xy dy Adz+(2y +z)dzAdx +(x —z)dx A 
dy. Then 


@(1,2,-3)(a, b) = 6dy A dz(a, b)+ dz A dx(a, b)+ 4dx ^ dy(a, b) 


= 6(a2b3 — a3bz) + (a3bı — a,b3) + 4(aıb2 — a2b1). $ 
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Finally, we generalize the notions of 1-forms and 2-forms to provide a defi- 
nition of a k-form. 


DEFINITION 1.1 Letk be a positive integer. A basic (differential) k-form 
on R” is an expression of the form 


do IN ax; INOS OTK dic 


where 1 < i; < n for j = 1,...,k. The basic k-forms are also functions that 
require k vector arguments a), a2, ..., a, and are evaluated as 


dx;,(a1) dxi (a2) Fen dxi (ax) 


dx (ai) dx (a2) --- dx; (ax) 
dx;, \-++ A dx;,(a1,..., ax) = det z 1 : . j 


| dxi (a1) dxj,(a2) +++ dx (ak) | 


EXAMPLE 5. Let 
ai = (1,2, —1,3,0), a= (5,4,3,2,1), and a = (0, 1,3, —2, 0) 


be three vectors in R5. Then we have 


1 5 0 
dx, A dx3 A dxs(a1, a2, a3) = det} —1 3J 3 | ==3; 
0 1 0 ? 


Using properties of determinants, we can show that 


dxi Ao A dxi Nii A dxi Nie A dxi 


= —dx;, Atie A dx, A+++ Ada Neee Adag 


dx;, A+++ Adxj, Aeee Adxi, A... Ndx;, = 0. 


Formula (3) says that switching two terms (namely, dx;, and dx;,) in the basic 
k-form dx;, A ++- A dx;, causes a sign change, and formula (4) says that a basic 
k-form containing two identical terms is zero. Formulas (3) and (4) generalize 
formulas (1) and (2). 


DEFINITION 1.2 A general (differential) k-form on U C R” is an expres- 
sion of the form p 
o= S E dan dx, 
e E 


where each F;,_;, is a real-valued function F; i: U —> R. Given a point 


xo € U, we evaluate w on an ordered k-tuple (a), ..., ag) of vectors as 


n 
Op Gina S Fy ota, A Ad Gn a): 


(lances =l 
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Note that a 0-form is so named because, in order to be consistent with a 
1-form or 2-form, it must take zero vector arguments! 
In view of formulas (3) and (4), we write a general k-form as 


w = > Fii d Xi IN aA, dXi,. 


1Si| <- <i <n 


(That is, the sum may be taken over strictly increasing indices i1, ..., ig.) For 
example, the 4-form 


w = xı dxı A dx3 A dx4 ^ dxs + (x3 — xz) dx A dx A dxs A^ dx3 
+ xıx3 dxs A dx3 A dx4 ^ dx, 


may be written in the “standard form” with increasing indices as 
æ = (x2 — xı x3)dxı A dx3 A dx4 A dxs + (x2 — x3)dx, A dx? A dx3 A dxs. 


Two k-forms may be added in the obvious way, and the product of a 0-form 
f and a k-form w is analogous to the product of a 0-form and a 1-form. 


Exterior Product 


The symbol A that we have been using does, in fact, denote a type of multiplication 
called the exterior (or wedge) product. The exterior product can be extended to 
general differential forms in the following manner: 


DEFINITION 1.3 LetU C R” beopen. Let f denote a0-form on U.Letw = 
Y 0%, A Ada denote a k-form on U and 7 =) G;, dx; A 
--» A dx; an /-form. Then we define 


To- o >) fia fda, 
ONn= eG een, IK.o2.8 A dXxi, A abe 2 -- A dx;j,. 


Thus, the wedge product of a k-form and an /-form is a (k + /)-form. 


EXAMPLE 6 Let 
w = x? dx, A dx + (2x3 — x2)dxı A dx3 + e® dx3 ^ dx4 
and 
n = x4dxı A dx3 A dxs + x6 dx ^A dx4 ^ dXx6 
be, respectively, a 2-form and a 3-form on R°. Then Definition 1.3 yields 


OAn= xix dx; A dx. A dx, A dx3 ^ dxs 
+ (2x3 — x2)x4 dx, A dx3 A dx, A dx3 A dxs 
+ e x4 dx3 A dx4 A dx, A dx3 ^ dxs 
+ x? x6 dx; A dx A dx ^A dx4 A dX6 


+ (2x3 — x2)x6 dx, A dx3 A dx2 ^ dx4 ^ dx6 


+ exe dx; A dx4 A dx ^ dx4 ^ dXx¢. 


8.1 


Determine the values of the following differential forms on the 
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Because of formula (4), most of the terms in this sum are zero. In fact, 


w A Nn = (2x3 — X2)x6 dx, A dx3 A dx2 A dx4 A dxX6 


using formula (3). 


= (x2 — 2x3)x6 dx, A dx2 A dx3 A dx4 A dxe, 


+ 


From the various definitions and observations made so far, we can estab- 
lish the following results, which are useful when computing with differential 


forms: 


PROPOSITION 1.4 (PROPERTIES OF THE EXTERIOR PRODUCT) Assume that all 
the differential forms that follow are defined on U C R”: 


1. Distributivity. If @, and œz are k-forms and ņ is an /-form, then 


(@i +ar)An=a1An+ a2 An. 


2. Anticommutativity. If @ is a k-form and 7 an /-form, then 


onn=(-lnaro. 


3. Associativity. If œ is a k-form, 7 an /-form, and t a p-form, then 


(WAN)AT=H@A(NAT). 


4. Homogeneity. If @ is a k-form, 7 an /-form, and f a 0-form, then 


Exercises 


ordered sets of vectors indicated in Exercises 1—7. 


1 


a A W N 


. dx; — 3dx2; a = (7, 3) 

. 2dx + 6dy —5dz;a= (1, —1, —2) 

- 3dx; A dx2;a = (4, —1), b = (2, 0) 

. 4dx Ady —7dy A dz; a = (0, 1, —1), b = (1, 3, 2) 

. Tdx Ady Adz; a= (1,0,3), b=(2,—1,0), e= 
(5, 2, 1) 

. dx; A dx, +2dx2 Adx3+3dx3Adx4; a= (1,2, 
3,4), b = (4, 3, 2, 1) 

. 2dx, A dx3 A dx4 + dx A dx3 A dxs; a= (1,0, 


—1, 4,2), b = (0, 0, 9, 1, —1), e = (5, 0, 0, 0, —2) 
. Let w be the 1-form on R? defined by 
w=x'ydx+yzdy+ 2x dz. 
Find @3,—1,4)(a), where a = (a1, a2, a3). 
. Let w be the 2-form on Ri given by 
@ = xx; dx, A dx3 — X2x4dx2. A dXx4. 


Find @,—1,—3,1)(a, b). 


10. 


11. 


12. 


13. 


15. 


(fo)An= f@An=oA(fn). 


Let w be the 2-form on R? given by 

æ = cosxdxA dy — sin z dy^ dz + (y? + 3)dxA dz. 
Find @@,-1,7/2)(a, b), where a= (a, a2, a3) and 
b = (by, bo, b3). 


Let w be as in Exercise 10. Find a ,5,-)((2, 0, —1), 
(1,7, 5)). 


Let w be the 3-form on R? given by 
w = (e* cos y + Gy + 2)e**) dx A dy ^ dz. 


Find @(,0,0)(a,b,¢), where a= (a1, a2, a3), b= 
(bı, bo, b3), ande = (ci; C35 C3). 


Let w be as in Exercise 12. Find a,,y,-)((1, 9, 0), 
(0, 2, 0), (0, 0, 3)). 


In Exercises 14-19, determine w ^ n. 


14. 


On R: w=3dx+2dy—xdz; 
cos y dy + 7 dz. 


n = x°dx — 


On RÌ: w= ydx — x dy; n=zdx ^dy+ydx ^ 
dz + x dy ^ dz. 
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16 


17. 


18. 


19. 


20. 


Chapter 8 | Vector Analysis in Higher Dimensions 


OnR*:@=2dx, A dx — x3 dx2 A dxa; n =2x4 dx; A 
dx3 + (x3 — x2)dx3 A dx4. 


On Rê: w =X, dx, + 2x2 dx. + 3x3 dx3; n =(xı + 
x2)dxı A dx A dx3 + (x3 — x4)dxı A dx2 A dx4. 


On Rô: w= (xı + x2)dxı A dxz A dx3 + (x3 — x4) 
dx, A dx A dx4; n= x; dxı + 2x2 dx2 + 3x3 dx3. 


On R: @ = xı dx2 A dx3 — x2X3 dxı ^ dxs; 
n = eSdx, A dx4 A dx5 — xı cos x5 dx A dx3 ^ 
dx4. 


Prove formula (3) by evaluating dx;, A dxi, \--- A 


dxi, on k vectors a;,..., a, in R”. 


21. 
22. 


23. 
24. 


25. 
26. 


Prove formula (4). (Hint: Use formula (3).) 


Explain why a k-form on R” with k > n must be iden- 
tically zero. 


Prove property | of Proposition 1.4. 


Prove property 2 of Proposition 1.4. (Hint: Use for- 
mula (3).) 


Prove property 3 of Proposition 1.4. 


Prove property 4 of Proposition 1.4. 


8.2 Manifolds and Integrals of k-forms 


In this section, we investigate how to integrate k-forms over k-dimensional objects 
(i.e., curves, surfaces, and higher-dimensional analogues) in R”. 


Integrals over Curves and Surfaces 


We begin by considering integrals of 1-forms and 2-forms over parametrized 


curves and surfaces. 


DEFINITION 2.1 


Let x: [a, b] > R” be a C! path in R”. If w is a 1-form 
defined on an open set U C R” that contains the image of x, then the integral 
of w over x, denoted f œw, is 


l Q= | i oxa H)dt. 


EXAMPLE 1 Let w = (x? + y)dx + yz dy + (x + y — z) dz. We integrate w 
over the path x: [0, 1] > R3, x(t) = (2t + 3, 3t,7 — t). 
We have x’(t) = (2, 3, —1) so that, using Definition 2.1, we find that 


1 
fe >] 21+3,31,7-1(2, 3, —1) dt 
x 0 


1 
=] [((2t +3)? + 3t) dx(2, 3, —1) + 3t(7 — t)dy(2, 3, —1) 
0 


+ (2t+3+ 3t —(7 —1))dz(2, 3, —l)] dt 


=| lar 


15t 


=f t? 


In general, if 


w = F; dx, + Fy dx + 


87t 


9). 2 + (21t — 3t) -3 + (6t — 4): (—1)]dt 


— 391 
22)dt = 221. 


e-e + Fy dx, 
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is a 1-form on R” and x: [a, b] — R” is any path, then 
oN E) = FAE dA E) + PEA) dx A) 
+ + FrEE dan A) 
= FAAO + PREO H + Fn(x())x, 0) 
= (FEO), PREO) ---, Fn(x()) xO. 


From this we conclude the following: 


PROPOSITION 2.2 If F denotes the vector field (F1, F2, ..., Fn) and 
œ = Fı dx + Fo dx +- -- Px 
and if x: [a, b] > R” is a C! path, then 


fe = [ oxn (1)) dt = i F(x(t))+ x(t) dt = [Fas 


That is, integrating a 1-form over a path (or, indeed, over a simple, piecewise C! 
curve) is exactly the same as computing a vector line integral. 


Now we see how to integrate 2-forms over parametrized surfaces in R°. 


DEFINITION 2.3 Let D be a bounded, connected region in R? and let 
X: D — R? bea smooth parametrized surface in R°. If w is a 2-form defined 
on an open set in R? that contains X(D), then we define is æ, the integral 


of w over X, as 
if w= i wxg, n(Ts, T;) ds dt. 
x D 


(Recall that T, = 0X/ds and T; = 0X/dt.) 


Let’s work out the integral in Definition 2.3. We write w as 
F, dy Adz + Fy dz A dx + F3dx A dy 
and X(s, t) as (x(s, t), y(s, t), z(s, t)). Therefore, 


fo=ff @x(s,)(Ts, T;) ds dt 
X D 


= Tf [Fi(X(s, t))dy A dz(T;, T;) + Fo(X(s, t))dz A dx(Ts, T;) 
D 
+ F3(X(s, t))dx A dy(T,, T,)] ds dt. 


By definition of the basic 2-forms, 


dy(Ts) — dy(T;) 
dz(T;)  dz(T;) 


dy A dz(T,, T;) = det 


dy/ds  dy/dt | A(y,z) 
dz/as  dz/at a(s, t) 


= det 
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Similarly, we have 


0 Py ð ’ 

OD) at dena TS 
a(s, t) a(s, t) 
Hence, if F = (F, Fo, F3), then 


_ 3Y, z) a(z, x) 
[ be i [ [ræ O 


d(x, y) 
a(s, t) 


_ d(y,Z) OZ, x) A(x, y) 
g J I, POMS, 1) (oe 1)’ 36, H 86.0) ) a 


dz A dx(Ts, T;) = 


+ F3(X(s, t)) 


Jasar 


Recall from formula (7) in §7.1 that 
(Se z) A(z, x) A(x, y) 
a(s, t)” O(s,t)’ As, t) 


the normal to X(D) at the point X(s, t). Therefore, we have established the fol- 
lowing Proposition (see also Definition 2.2 of Chapter 7): 


) = N(s, f), 


PROPOSITION 2.4 If F denotes the vector field F = F; i + F)j + F; K and 
w = Fı dy Adz + Fdz A dx + F; dx Ady 


and if X: D + R? is a smooth parametrized surface such that w (or F) is defined 
on an open set containing X(D), then 


[ o= f Í oxa Tas at = | [ F(X(s, 9) NOs, ds dt 
= | f F-as. 


Parametrized Manifolds 


Next, we generalize the notions of parametrized curves and surfaces to 
higher-dimensional objects in R”. To set notation, let R% have coordinates 


(uy, u2, EEAS Ux). 


DEFINITION 2.5 Let D bearegion in R¥ that consists of an open, connected 
set, possibly together with some or all of its boundary points. A parametrized 
k-manifold in R” is a continuous map X: D — R” that is one-one except, 
possibly, along 0D. We refer to the image M = X(D) as the underlying 
manifold of X (or the manifold parametrized by X). 

Such a k-manifold possesses k coordinate curves defined from X by 
holding all the variables u1, ..., ug fixed except one; namely, the jth coor- 
dinate curve is the curve parametrized by 


Hi => N Gilg 0509 C-i Un Ojai 00 9 Oh) 
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where the a;’s (i # /) are fixed constants. If X is differentiable and x1, x2,..., 
Xn denote the component functions of X, then the tangent vector to the jth 
coordinate curve, denoted T,,,, is 


ax Ox, OX =) 
= Ou; Ou; i 


A parametrized k-manifold is said to be smooth at a point X(uọ) if the 
mapping X is of class C! in a neighborhood of ug and if the k tangent vectors 
T,,,---, Tu, are linearly independent at X(uo). (Recall that k vectors vı, ..., V 
in R” are linearly independent if the equation c1vı + --- + ckv = 0 holds ifand 
only ifc = cp = +++ = cy = 0.) A parametrized k-manifold is said to be smooth 
if it is smooth at every point of X(uọ) with up in the interior of D. 

Sometimes we will refer to the underlying manifold M = X(D) of a para- 
metrized manifold X: D —> R” as a parametrized manifold; we do not expect any 
confusion will result from this abuse of terminology. 


EXAMPLE 2 Let D = (0, 1] x [1, 2] x [—1, 1] and X: D > R? be given by 
X(u1, U2, u3) = (u1 + u2, 3U2, U2U3, U2 — u3, 5u3). 


We show that M = X(D) is a smooth parametrized 3-manifold in R. 
Note first that X is continuous (in fact, of class C°°) since its component 
functions are polynomials. To see that X is one-one, consider the equation 


X(u) = X(u); (1) 
we show that u = ù. Equation (1) is equivalent to a system of five equations: 
uy + uy = Hy + ig 
3u2 = 32 
uu? = doit; 
u — U3 = Uo — U3 
Suz = 53 


The second equation implies u2 = i, and the last equation implies u3 = 3. 
Hence, the first equation becomes 


ui +u = ùi +u S&S w=. 
Thus, 
u = (Uy, u2, u3) = (ŭ1, A, ŭ3) = Ù. 


To check the smoothness of M, note that the tangent vectors to the three 
coordinate curves are 


ox 

Ty, a (1; 0, 0, 0, 0); 
ðu 
ox 

1-3, = (1,3, u3, 1, 0); 
ox 


Ty, =. —. = (0, 0, 2u.u3, —1, 5). 
3 
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(x1, y1) 


Figure 8.1 The planar robot arm 
of Example 3. Each rod is free to 
pivot about the appropriate linkage 
points. 


Therefore, to have cT + coT2 + c3T3 = 0, we must have 
(c1 + c2, 3c, u3c2 + 2u2U3C3, C2 — C3, 53) = (0, 0, 0, 0, 0). 


It readily follows that cı = c2 = c3 = 0 is the only possibility for a solution. 
Hence, T,,,, T,,,, Tu, are linearly independent at all u € D and so M is smooth at 
all points. + 


Parametrized k-manifolds, although seemingly abstract mathematical notions 
when k is larger than 3, are actually very useful for describing a variety of situa- 
tions, one of which is illustrated in the next example. 


EXAMPLE 3 A planar robot arm is constructed consisting of three linked rods 
of lengths 1, 2, and 3. (See Figure 8.1.) The rod of length 3 is anchored at the 
origin of R? but free to rotate about the origin. The rod of length 2 is attached to 
the free end of the rod of length 3, and the rod of length 1 is, in turn, attached to 
the free end of the rod of length 2. We describe the set of positions that the arm 
can take as a parametrized manifold. 

Clearly, each state of the robot arm is determined by the coordinates (x1, y1), 
(x2, y2), and (x3, y3) of the linkage points, which we may consider to form a vector 
X = (x1, Y1, X2, Y2, X3, y3) in RÍ. However, not all vectors in R° represent a state 
of the robot arm. In particular, the point (x1, y1) must lie on the circle of radius 3, 
centered at the origin, the point (x2, y2) must lie on the circle of radius 2, centered 
at (x1, y1), and the point (x3, y3) must lie on the circle of radius 1, centered at 
(x2, y2). Thus, for x = (x1, Y1, X2, Y2, X3, y3) to represent a state of the robot arm, 
we require 


X? + yr =9 
(2 —-myY+O02-yn) =4. (2) 
(x3 — 2 +03 —- yy =1 


We may parametrize each of the circles in the system (2) in a one-one fashion by 
using three different angles 6), 02, and 63. Hence, we find 


(x1, y1) = (3 cos 64, 3 sin 01), 
(x2, y2) = (xı + 2 cos 62, yı + 2 sin 62) 
= (3cos 0; + 2 cos 62, 3 sin 01 + 2 sin 62), (3) 
and 
(x3, y3) = (x2 + cos 63, y2 + sin 03) 
= (3 cos 0; + 2 cos 4) + cos 63, 3 sin 01 + 2 sin 02 + sin 63), 


where 0 < 01, 02, 03 < 27 . Therefore, the map X: [0, 27) x [0, 277) x [0, 2m) > 
R° given by 


X(01, 02, 03) = (x1, Y1, X2, Y2, X3, Y3), 


where (x1, Y1, X2, Y2, X3, y3) are given in terms of 01, 02, and 63 by means of the 
equations in (3), exhibits the set of states of the robot arm as a parametrized 3- 
manifold in RÉ. We leave it to you to check that X defines a smooth parametrized 
3-manifold. + 
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Just like a parametrized surface, a parametrized k-manifold M = X(D) may 
or may not have a boundary, denoted, as usual, by 0M. If M has a nonempty 
boundary, then 0M is contained in the image under X of the portion of the 
boundary of the domain region D that is also part of D. Under suitable (and 
mild) hypotheses, 0M, if nonempty, is, in turn, a union of finitely many (k — 1)- 
manifolds (without boundaries). 


EXAMPLE 4 Let B C R? denote the closed unit ball {u = (u1, u2, u3) | u? + 
us + u < 1}, and define X: B > R by 


2,.2,.2 
X(u1, U2, U3) = (U1, U2, U3, Uy + U5 + u3). 


Then M = X(B) is a portion of a “generalized paraboloid” having equation w = 
x? +y? +z; we have M = {(x, y, z, w) € Rô | w = x? + y? + 2, x? +y? 4ṣ 
z? < 1}. In this case, 9M = {(x, y, z, 1) | x? + y? +z? = 1}. Note that 2M is a 
parametrized 2-manifold in R4, as we may see via the map 


Y:[0, z] x [0, 2m) > Rt, Y(s,t) = (sins cost, sins sint, coss, 1). + 


Integrals over Parametrized k-manifolds 


Now, we see how to define the integral of a k-form over a smooth parametrized 
k-manifold. Our definition generalizes those of Definitions 2.1 and 2.3. 


DEFINITION 2.6 Let D bea bounded, connected region in R* and X: D > 
R” a smooth parametrized k-manifold. If w is a k-form defined on an open 
set in R” that contains M = X(D), then we define the integral of w over M 
(denoted fy w) by 


fo=f--f Oxcu) Uran aeo a Mi OU aie 
X D 


(Here f --- f refers to the k-dimensional integral over D.) 


EXAMPLE 5 Let X: [0, 1] x [1,2] x [-1, 1] —> R be the parametrized 3- 
manifold defined by 


X(u1, u2, u3) = (u1 + u2, 3u2, U2U7, Uy — u3, 5u3). 
(See Example 2.) Let w be the 3-form defined on R° as 
w = X1xX3 dx, A dx3 A dxs + (x3x4 — 2x2x5)dx2 A dx4 A dxs. 


We calculate fy œ. 
Recall from Example 2 that the tangent vectors to the three coordinate curves 
are 


Ti, = d, 0, 0, 0, 0), 
Ta = (1, 3, u3, 1, 0), 


and 
Tu, = (0, 0, 2u2U3, —l, 5): 
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Then, from Definition 2.6, 


fo 


2 spi 
= J | l f(u + uz)uzu? dx; ^A dx3 ^A dx5(Ty,, Ty,,Tu3) 
=i JO 


+ (ugu3(uz — u3) — 30u2u3) dx A dx4 A dxs(Tu,, Tuy, Tus)} dui duz dus 


Iaa a 1 1 0 
= J / | (ui + uz)uzu? 0 uy 2u2U3 
-1J1 Jo 0 0 5 
0 3 0 
} (u5u3 uu? 30u2u3)| 0 1 —1 du, duz du; 
0 0 5 
I 2 yl 
= / Í S(uy + uz)uzu? du, du duz = 3, + 
=įiJi J 


EXAMPLE 6 Ifw isa 3-form on R’, then w may be written as 
w = F(x, y,z)dx Ady ^ dz. 


(Why?) If D* is a bounded region in R? and X: D* —> R? is a smooth parametrized 
3-manifold, then Definition 2.6 tells us that 


I JII OX(u uzu) Tu Tu» Tu) dui duz du3 
X D* 


=i] F(X(u)) dx ^ dy ^ dz(Ty,, Ty,, Tu,) dui duz dus 
D* 


dx /du, 0x /du dx /0U3 


=i] F(X(u))| dy/du; əƏy/ðu2 dy/duz | du; duz du; 
D* 
0z/du, 0z/du2 0z/0u3 


=f J l reos A E d 


O(Uy, U2, U3) 


=a) F(x, y, z) dx dy dz, 
D 


from the change of variables theorem for triple integrals (Theorem 5.5 of Chapter 
5), where D = X(D*). + 


Orientation of a Parametrized k-manifold - 


We have seen that vector line integrals and vector surface integrals may be defined, 
respectively, over oriented curves and surfaces in a manner effectively independent 
of the parametrization used. We now see how it is possible to define the integral 


T 
C 


Figure 8.2 An orientation of the 
curve C shown is a choice of 
continuously varying unit tangent 
vector T along C. 


N N 


Figure 8.3 An orientation of 
the surface S is a choice of 
continuously varying unit normal 
vector N along S. 
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of a k-form over a parametrized k-manifold X: D — R” so that it depends largely 
on the underlying manifold M = X(D), rather than on the particular map X. To 
do this, we must consider how reparametrization of M affects the integral, and 
we must define what we mean by an orientation of M. 

First, we consider the notion of orientation. We have previously seen how 
parametrized curves and surfaces can be oriented by using some fairly natural 
geometric ideas. A smooth parametrized curve implicitly received an orientation 
from the parameter; typically, we orient a curve by indicating the direction in 
which the parameter variable increases. We may also think of an orientation of 
a curve as a choice of a unit tangent vector T at each point of the curve, made 
so that T varies continuously as we move along the curve. (See Figure 8.2.) An 
orientation of a smooth parametrized surface in R*, when it exists, is a choice 
of a continuously varying unit normal vector N at each point of the surface. (See 
Figure 8.3.) 

To define notions of orientation and orientability for a parametrized k- 
manifold when k > 2, we will need to work more formally. 

First, we need to introduce two related ideas from the linear algebra of R”. 
Thus, suppose vj, V2,..., Vg are vectors in R”. By a linear combination of 
Vi, -.-, Vx, we mean any vector v € R” that can be written as 


V = C1V1 + C2V2 +++ + CK 


for suitable choices of the scalars c),..., cx. The set of all possible linear com- 
binations of vı, ..., Vx, called the (linear) span of v,,..., vg, will be denoted 
Span{v,,..., Vg}. That is, 


Span{vj,..., Ve} = {c1v) +--+: + xv | c1,---, Ck E R} 


DEFINITION 2.7 Let M = X(D), where X: D C Rt -> R”, be a smooth 
parametrized k-manifold. An orientation of M is a choice of a smooth, 
nonzero k-form Q defined on M. If such a k-form Q exists, M is said to be 
orientable and oriented once a choice of such a k-form is made. 


Although we cannot readily visualize an orientation Q of a parametrized k- 
manifold when k is large, we can nonetheless see how the tangent vectors to the 
coordinate curves relate to it. 


DEFINITION 2.8 Let M = X(D) be a smooth parametrized k-manifold 
oriented by the k-form Q. The tangent vectors T,,,, ..., Tu, to the coordinate 
curves of M are said to be compatible with Q if 


Qx) (Tu 5 8) E) = 0. 


We also say that the parametrization X is compatible with the orientation Q 
if the corresponding tangent vectors T,,,..., T,, are. 


Note that if T,,,..., Ta, are incompatible with the orientation Q, then they 
are compatible with the opposite orientation —Q2. Alternatively, we may change 
the parametrization X of M by reordering the variables u1, ..., ug to, Say, U2, U1, 
U3,..., Uk, SO that T,,, Ty,, Ty;,..., Ty, are compatible with Q. 
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Definition 2.7 is consistent with the earlier definitions of orientations of 
curves and surfaces, as we now discuss. Suppose first that x: Z —> R” is a smooth 
parametrized curve in R” (where / is an interval in R) and T is a continuously 
varying choice of unit tangent vector along C = x(/). Then we may define an 
orientation 1-form Q on C by 


Qy(n(a) = T.a. 


Conversely, given an orientation 1-form Q, we may define a continuously varying 
unit tangent vector from it by taking T to be the unique unit vector parallel to 
x’(t) such that, for any nonzero vector a parallel to x‘(t), 


T-a has the same sign as Qx) (a). 


That T is uniquely determined follows because T must equal +x’(t)/||x’(t)||, so 
knowing a and the value of Q2x)(a) determines the choice of sign for T. 

Similarly, suppose 5 = X(D) is a smooth parametrized surface in R? (i.e., a 
smooth parametrized 2-manifold). If we can orient S by a continuously varying 
unit normal N, then we may define an orientation 2-form Q on S by 


Qx(u,,u)(A, b) = det [ N ab i) 


where [ Nab ] is the 3 x 3 matrix whose columns are, in order, the vectors 
N, a, b. Conversely, given an orientation 2-form Q on S, we may define a con- 
tinuously varying unit normal N from it by taking N to be the unique unit vector 
perpendicular to T,,, and T,,, (and hence to every vector in Span{T,,, T,,,}) such 
that, for any pair a, b of linearly independent vectors in Span{T,,, T,,,}, 


det[ N a b ] has the same sign as yq, ,u)(a, b). 
To see that N is uniquely determined, note that, given linearly independent vectors 
a, b in Span{T,,, T,,,}, the only possibilities for N are 
Ty, x Ti, 
Tu, x Tall 


Hence, we choose the sign for the normal vector N so that det [ N ab ] has 
the same sign as Q2x(u,,1,)(a, b). 


EXAMPLE 7 Consider the generalized paraboloid M = {(x, y, z, w) € Rf | 
w = x? + y? + 27}, which we may exhibit as a smooth parametrized 3-manifold 
via 


X:R? > R, X(u,, u2, u3) = (u1, U2, U3, u? + uz + U3). 


We show how to orient M. 

Note that the equation x? + y? + z? — w = 0 shows that M is the level set at 
height 0 of the function F(x, y, z, w) = pe As y? +z? — w. Hence, the gradient 
VF = (2x, 2y, 2z, —1) is a vector normal to M. If we employ the parametrization 
X and normalize the (parametrized) gradient, we see that 


(2u, 2u2, 2u3, —1) 
(403 + 4u? + 4u3 +1 


N(u1, u2, u3) = 


is a continuously varying unit normal. Moreover, the 3-form Q defined on M as 


xw (a1, a2, a3) = det [N a; a2 a3 | 
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gives an orientation for M. Note that 


Qx~a(Tr, > Ti; T) = det [N Tu, Ti, Ta] 


ai 1 0 0 
Jah 4u 4u5 1 
w 0 1 0 
Ja Aur, 4u3 1 
= det 
a 0 0 1 
aur 4u3 Au 1 
= 2u; 2u 2 
uj uz u3 
(43 4u3 Au 1 
= Jin? Aus, Aus 1. 


Since this last expression is strictly positive, we see that T,,, T,,,, T,, are com- 
patible with Q. + 


EXAMPLE 8 We may generalize Example 7 as follows: 

Suppose that M C R” is the graph of a function f: U C R’"! > R”; that 
is, suppose M is defined by the equation x, = f(x,,...,X,—1). Then M may be 
parametrized as an (n — 1)-manifold via 


X: U = R”! => R”, X(u, EAT Un—1) = (u, e.. Un=ls f(u, eeey Un—1))- 
Since M is also the level set at height 0 of the function 
F(X,.+.,%) = I isse sy tnai) aes 


a vector normal to M is provided by the gradient VF = (f,,,..-. fernis — 1). If we 
normalize V F and use the parametrization X, we see that we have a continuously 
varying unit normal 


(fur erie) Juis =!) 


N(ui,..., Un 1) = 2 
er 


from which we may define our orientation (n — 1)-form Q for M by 


Qx(@1,.--, 1) =det[N a; «++ ayy]. š 


Now suppose that M is a smooth parametrized k-manifold in R” with non- 
empty boundary 0M. If M is oriented by the k-form Q, then there is a way to 
derive from it an orientation for 0M, which we describe in Definition 2.9. To 
set notation, let X: D C Rt —> R” denote the parametrization of M and suppose 
Y: E CR‘! — R” gives a parametrization of a connected piece of 3M as a 
smooth (k — 1)-manifold. Since 0M is part of M, ifs = (s),..., Sk-1) € E, then 
there is some u = (u1, ..., uk) € D such that Y(s) = X(u). 
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Figure 8.4 The outward-pointing 
unit vector V of Definition 2.9. 


DEFINITION 2.9 Let M be a smooth parametrized k-manifold in R” with 
boundary 0M. Suppose M is oriented by the k-form Q. Then the connected 
pieces of ðM are said to be oriented consistently with M, or that 0M has 
its orientation induced from that of M, if the orientation (k — 1)-form Q?” 
is determined from Q as follows. Let V be the unique, outward-pointing unit 
vector in R”, defined and varying continuously along 0M, that is tangent to 
M and normal to 0M. (See Figure 8.4.) Then 2 is defined as 


Tio Ek see o i) = roo Billo oc on Bea) 


where the map X: D C R% —> R” parametrizes M, the map Y: E C RE! > 
R” parametrizes a connected piece of 0M, and Y(s) = X (u). 


Note that, in particular, the vector V in Definition 2.9 must be such that 


* V € Span{T,,, ..., Ta, } Ge., V is tangent to M); 
eVo —0 tori a eV 1s normalitoro W); 
° V points away from M. 


These conditions are often not difficult to achieve in practice. Definition 2.9 will 
be very important when we consider a generalization of Stokes’s theorem in the 
next section. 


EXAMPLE 9 Consider the surface S in R? consisting of the portion of the 
cylinder x? + y? = 4 with 2 < z < 5. Note that the boundary of S consists of the 
two circles {(x, y, z) | x? + y? = 4, z = 2}and {(x, y, z) | x? + y? = 4, z = 5}. 
We investigate how to orient 0S consistently with an orientation of S. 

The cylinder may be parametrized as a 2-manifold in R? by 


X: [0, 277) x [2,5] > R’, X(u1, u2) = (2 cos u1, 2 sin u1, u2). 
Then the tangent vectors to the coordinate curves are 
T,, = (—2 sin u4, 2 cos u1, 0) 
and 
Ta = (0, 0, 1). 


Since S is a portion of the level set at height 4 of the function F(x, y, z) = x? + y’, 
a unit normal N to S is given by 


VE _ (2x, 2y, 0) 2 ( y 0) 
IVF] yat \2°2’ 
In terms of the parametrization X, the normal N is also given by 
N = (cos uy, sin u1, 0). 
Then we may define an orientation 2-form on S by 


Q X(u, u) (A1, a2) = det [N al a]. 


Figure 8.5 Orienting the 
boundary of the surface S of 
Example 9. Note the 
outward-pointing tangent vectors 
Viop and Vbottom- 
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Hence, 
cosu; —2sinu, 0 
Qx (0) (Tu f T) = det sin uy 2 cos Ui 0 =2 s0. 
0 0 1 


Thus, T,,, Ta, are compatible with Q. 
We may parametrize 0S by using two mappings: 


Bottom circle: | Y,:[0,27) > R3, Yj (s) = (2coss, 2 sins, 2) 
and 
Top circle: Y»:[0,27)— R, Y2(s) = (2coss, 2sins, 5) 


To use Definition 2.9 to orient 3 S, we must identify outward-pointing vectors tan- 
gent to S and normal to 0S. From Figure 8.5, we see that along the top circle V = 
Viop = (0, 0, 1) works, while along the bottom circle, V = Vobottom = (0, 0, —1) 
suffices. Hence, Definition 2.9 tells us that, along the bottom circle, 


QY (a) = Q2x(5,2)(Vbottom, a) = det [ N Voottom a ] ’ 
while along the top circle, 
QY (A) = Qy(s,5)(Viop, a) = det[ N Viop a]. 


For both maps Y; and Y2, we have that the coordinate tangent vector is T, = 
(—2 sins, 2coss, 0). Thus, along the bottom circle, 


coss 0 —2sins 
OF (Ts) = det | sins 0 2coss |=2, 
0 -1 0 


so T, is compatible with the orientation 1-form °°. However, along the top 
circle, 
coss 0 —2sins 
Wao (Ts) =det| sins 0 2coss | = -—2, 
0 1 0 


so T, is incompatible with Q°5. Therefore, we must orient the top circle clockwise 
around the z-axis and the bottom circle counterclockwise. + 


The following example is the three-dimensional analogue of Example 9: 


EXAMPLE 10 Considerthe subset M C Rf givenby M = {(x, y, z, w) | x? + 
y? +z? = 4, 2 < w < 5}. This set M is a portion of the cylinder over a sphere 
of radius 2. Note that the boundary of M consists of the two spheres Spottom = 
{(x, y, z, 2) | x? + y? +z? = 4} and Stop = {(x, y, z, 5) | x? + y? +z? = 4}. We 
investigate M and ð M as parametrized manifolds, orient M, and study the induced 
orientation on 0M. 

First, we note that M may be parametrized as a 3-manifold in R* by 


X: [0, x] x [0, 27) x [2,5] > Rê, 
X(u1, U2, U3) = (2 sin u; cos u2, 2 sin u; sin u2, 2 COS u1, u3). 


(This is the usual parametrization of a sphere using spherical coordinates g = u1, 
0 = uz, with an additional parameter u3 for the “vertical” w-axis.) The tangent 
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vectors to the coordinate curves are given by 
T„, = (2 cos u1 cos u2, 2 cos u; sin u2, —2 sin u1, 0), 


T,, = (—2 sin u; sin uz, 2 sin u; cos u2, 0, 0), 


and 
Ta, = (0, 0, 0, 1). 


Note that this parametrization fails to be smooth when uw, is 0 or x, since then 
T,,, = 0 at those values for u;. You can check that the parametrization is smooth 
at all other values of X(u) (1.e., for u in (0, x) x [0, 277) x [2, 5]). 

Because M is a portion of the level set at height 4 of the function F(x, y, 
z, w) = x? + y? + z*, a unit normal N to M is given by 


VF (2x, 2y, 2z, 0) -(ž y z 0) 
IVF Il /4x?2 + 4y2 + 47? 222 a 


In terms of the parametrization X, the normal N is also given by 
N = (sin u; COS u2, sin u; sin u2, cos Uy, 0). 
We define an orientation 3-form Q for M by 
Qxw)(aı, a2, a3) = det[N a; a as]. 
Then 


sin u COSU? 2COSU; COSU? —2sinu,sinu, 0 


sinu sinuz 2cosu,sinu, 2sinu;cosu, OV 
Qxauy(Ty, > Ti; T) = det 


COS Uy —2 sin uy, 0 0 
0 0 0 1 
= 4sinu, > 0 


for 0 < uı < x (which is where the parametrization X is smooth). Hence, T,,,, 
Ta, Ta, are compatible with Q. 
We parametrize the two pieces of 0M with two mappings: 


“Bottom” sphere Spottom: 
Y,:[0, z] x [0, 27) > Rê, 
Y\ (51, 52) = (2 sin sı cos s2, 2 sin sı sin s2, 2 cos s1, 2), 
and 
“Top” sphere Stop: 
Y2: [0, 2] x [0, 27) > R4, 
Y2(51, 52) = (2 sin sı cos $2, 2 sin sı sin s2, 2 cos s1, 5). 
Note that both parametrizations Yı and Y> give the same tangent vectors to the 
corresponding coordinate curves, namely, 
T;, = (2 cos sı cos s2, 2 cos sı sin s2, —2 sin sı, 0) 
and 
T,, = (—2 sin sı sin s2, 2 sin sı cos s2, 0, 0), 


and, by considering these tangent vectors, we see that the parametrizations are 
smooth whenever sı 4 0, 7. 
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To give 0M the orientation induced from that of M, we identify outward- 
pointing unit vectors tangent to M and normal to 0M. Thus, we need V such that 


° Ve Span{T,,, Tan, Toa} <> V-N=0; 
* V-T, =V-T,, = 0; 
e V points away from M. 


It’s not difficult to see that we must take V = Viop = (0, 0, 0, 1) along Stop and 
V = Voottom = (0, 0,0, —1) along Stottom. Therefore, Definition 2.9 tells us that 
along Spbottom, 


OM (At ; a2) = Q2x(5,2)(Vobottom: al, a2). 
In particular, 
OM (Ta. Ta) = det[N Voom Ts Ta] 


sin sı coss2 0 2coss;,coss. —2 sin sı sin s2 
sin sı sins. 0  2cossısins2 2 sin sı COS s2 


= det 
Me cos S| 0 —2 sin sı 0 
0 —1 0 0 
= 4sinsı > 0 


for 0 < sı < x (i.e., where the parametrization Y; is smooth). Thus, Y; is com- 
patible with Q°”. Along Stop, however, we have 


OF ln , T;,) = det [N Voottom Ts, T, ] 


sins; coss$2 0 2cossıcoss2 —2 sin sı sin s2 
sin sı Sins. 0 2coss; sins, 2 sinsi cos s2 
= det i 
COS s1 0 —?2 sin sı 0 


0 1 0 0 
= —4sinsı < 0 


for 0 < sı < x, so Y2 is incompatible with Q°” We must take care with this 
distinction when we consider the general version of Stokes’s theorem. + 


Next, we examine how the integral of a k-form œw can vary when taken 
over two different parametrizations X: Dı —> R” and Y: D} —> R” for the same 
k-manifold M = X(D,) = Y(D2). 


DEFINITION 2.10 Let X: Dı C Rt > R” and Y: D, CR* > R” be 
parametrized k-manifolds. We say that Y is a reparametrization of X if there 
is a one-one and onto function H: D —> D; with inverse H~! D; —> D, such 
that Y(s) = X(H(s)), that is, such that Y = X o H. If X and Y are smooth and 
H and H`! are both class C1, then we say that Y is a smooth reparametrization 
of X. 


Since H is one-one, it can be shown that the Jacobian det DH cannot change 
sign from positive to negative (or vice versa). Thus, we say that both H and Y are 
orientation-preserving ifthe Jacobian det DH is positive, orientation-reversing 
if det DH is negative. 
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The following result is a generalization of Theorem 2.5 of Chapter 7 to the 
case of k-manifolds. 


THEOREM 2.11 Let X: Dı C R% — R” be a smooth parametrized k-manifold 
and œw a k-form defined on X(D,). If Y: Dy C Rt — R” is any smooth 
reparametrization of X, then either 


fo fo 


if Y is orientation-preserving, or 


fo=-[e 


if Y is orientation-reversing. 


In view of Theorem 2.11, we can define what we mean by f m 2» Where M 
is a subset of R” that can be parametrized as an oriented k-manifold and w is a 
k-form defined on M. We simply let 


f-k 


where X: D C R? —> R’ is any smooth parametrization of M that is compatible 
with the orientation chosen. 


EXAMPLE 11 We evaluate f c %, Where C is the (oriented) line segment in R? 
from (0, —1, —2) to (1, 2, 3) and œw = z dx + x dy + y dz. 

Using Theorem 2.11, we may parametrize C in any way that preserves the 
orientation. Thus, 


x: [0,1] > R3, x)= (1 — t)(0, —1, —2) + t(1, 2, 3) = (t, 3t — 1, 5t — 2) 


is one way to make such a parametrization. Then x'(t) = (1, 3, 5) and, hence, 
from Definition 2.1, we have 


fos fo= f emmera 


1 
=f {(5t — 2)-14+1-3+(3t—1)-5}dt 
0 


l 23 
=] (23t —7)dt = (e-r) 
4 2 


Note that if we parametrize C in the opposite direction by using, for example, 
the map 


1 


9 
0 2 


y:[0,1] > R?, y()=#(0, -1, -2)+ 1 — A, 2, 3)=(1 =t, 2 — 34, 3 — 5t), 
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then we would have 


Í = i EAO. 


1 
= : (G = 5-1) + = 3) + 2 = 31-5) dt 


i 2 
=f (23t — 16)dt = (Fe- 16) 
i 2 


In light of Theorem 2.11, this result could have been anticipated from our pre- 
ceding calculation of f w. ® 


1 


0 


Note on k-manifolds 


The central geometric object of study in this section, namely, a parametrized 
k-manifold, is actually a rather special case of a more general notion of a k- 
manifold. In general, a k-manifold in R” is a connected subset M C R” such that, 
for every point x € M, there is an open set U C R* and a continuous, one-one 
map X: U — R” with x € X(U) C M. (A k-manifold with nonempty boundary 
requires a somewhat modified definition.) That is, M is a (general) k-manifold 
if it is locally a parametrized k-manifold near each point. It is possible to extend 
notions of orientation and integration of k-forms to this more general setting, 
although it requires some finesse to do so. For the types of examples we are 
encountering, however, our more restrictive definitions suffice. 


1. 


2. 


Check that the parametrized 3-manifold in Example 3 is in 
fact a smooth parametrized 3-manifold. 


A planar robot arm is constructed by using two rods as 
shown in Figure 8.6. Suppose that each of the two rods may 
telescope, that is, that their respective lengths /; and l2 may 
vary between 1 and 3 units. Show that the set of states of 
this robot arm may be described by a smooth parametrized 
4-manifold in R4. 


y 


(x2, y2) \ 


(1, y1) 


Figure 8.6 Figure for Exercise 2. 


. A planar robot arm is constructed by using a rod of length 


3 anchored at the origin and two telescoping rods whose 
respective lengths l and l, may vary between 1 and 2 units 
as shown in Figure 8.7. Show that the set of states of this 


robot arm may be described by a smooth parametrized 5- 
manifold in R°. (See Exercise 2.) 


y Py (x3, y3) 


(x, y1) 


Figure 8.7 Figure for Exercise 3. 


. A robot arm is constructed in R? by anchoring a rod 


of length 2 to the origin (using a ball joint so that 
the rod may swivel freely) and attaching to the free 
end of the rod another rod of length 1 (which may 
also swivel freely; see Figure 8.8). Show that the set of 
states of this robot arm may be described by a smooth 
parametrized 4-manifold in R°. 


. Suppose vı, .. . , Vg are vectors in R”. If x € R” is orthog- 


onal to v; fori = 1,...,k, show that x is also orthogonal 
to any vector in Span{v),..., Vg}. 
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(X2, Yo, Z2) 


x 


Figure 8.8 Figure for Exercise 4. 


. Leta, b, and be positive constants and x: [0, 7] —> R? 


the smooth path given by x(t) = (a cost, b sint, ct). If 
w = bdx — a dy + xy dz, calculate f, w. 


. Evaluate JS c %2, Where C is the unit circle x? + y? =; 


oriented counterclockwise, and w = y dx — x dy. 


. Compute Je w, where C is the line segment in R” 


from (0,0,...,0) to (3,3,...,3) and w = xı dxı + 
x3 dx +-+- + x” dx. 


. Evaluate the integral f, x %» Where X is the parametrized 


helicoid 
X(s,f)=(s cost,s sint,t) O<s<1,0<t<47 


and 


w= zdx Ady+3dzA dx —xdy ^ dz. 


Consider the helicoid parametrized as 


X(u1, U2) = (u; cos 3u2, uy sin 3u2, 5u2), 


0<u,; < 5,0 <u < 27. 


Let S denote the underlying surface of the helicoid and 
let Q be the orientation 2-form defined in terms of X 
as 
—S5sin3u. a, by 
QX(u),u2) (A; b) = det 5cos3u? a bo 


—3u, a3 b3 


(a) Explain why the parametrization X is incompatible 


with Q. 


(b) Modify the parametrization X to one having the 


same underlying surface S but that is compatible 
with Q. 


(c) Alternatively, modify the orientation 2-form Q to Q’ 


so that the original parametrization X is compatible 
with Q’. 


11. 


12. 


13. 


14. 


15. 


(d) Calculate f5@, where w=zdx Ady —(x? + 
y?) dy A dz and S is oriented using Q. 


Let M be the subset of R? given by {(x, y, z) | x? + 
y?—-6<z2<4-—x?-—y’}. Then M may be 
parametrized as a 3-manifold via 


X: D—>R?:; X(u;, u2, u3)= (Uu; COS u2, Uy SiN U2, U3), 

where 

D={(u1, u2, u3) € R? | 0 < ui < V5, 0< u < 27, 
u? — 6 < us <4-u?}. 


(The parameters u1, u2, and u3 correspond, respec- 
tively, to the cylindrical coordinates r, 6, and z. Hence, 
it is straightforward to obtain the aforementioned 
parametrization.) 


(a) Orient M by using the 3-form Q, where 
Qxw(a,b,e)=det[a b c]. 
Show that the parametrization, when smooth, is 
compatible with this orientation. 


(b) Identify 0M and parametrize it as a union of two 
2-manifolds (i.e., as a piecewise smooth surface). 


(c) Describe the outward-pointing unit vector V, vary- 
ing continuously along each smooth piece of 0M, 
that is normal to 0M. Give formulas for it in terms 
of the parametrizations used in part (b). 

Calculate f, œ, where S is the portion of the paraboloid 

z=x?+y? with 0 <z<4, oriented by upward- 

pointing normal vector (—2x,—2y,1), and w= 
edx Ndy+ydzAdx+xdy Adz. 

Calculate J s% where S is the portion of the cylinder 

x? +z? = 4 with —1 < y < 3, oriented by outward 

normal vector (x, 0, z),andw = z dx Ady +e” dz ^ 

dx + x dy ^ dz. 

Consider the parametrized 2-manifold 

X: [1,3] x [0,27) > Rf, X(s,t) 
= (Vs cost, V4 — s sint, v/s sint, /4 — s cost). 


Find 


Í (x3 + x7) dx, ^ dx3 — (2x7 + 2x3) dx A dx4. 
x 
Consider the parametrized 3-manifold 
X:[0, 1] x [0, 1] x [0, 1] > R4, 
X(u1, u2, U3) = (u1, u2, u3, (2u1 — u3)°). 
Find 


f x2 dx2 A dx3 A dx4 + 2x1x3 dxı A dx2 A dx3. 
x 
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8.3 The Generalized Stokes’s Theorem 


We conclude with a discussion of a generalization of Stokes’s theorem that relates 
the integral of a k-form over a k-manifold to the integral of a (k — 1)-form over 
the boundary of the manifold. Before we may state the result, however, we need 
to introduce the notion of the exterior derivative of a k-form. 


The Exterior Derivative - 


The exterior derivative is an operator, denoted d, that takes differential k-forms 
to (k + 1)-forms and is defined as follows: 


DEFINITION 3.1 The exterior derivative df of a 0-form f on U C R” is 
the 1-form 
af af 


of 
ay = dx, 4 dx»+---4 On 
f Ox] si 0X2 a OG, = 


For k > 0, the exterior derivative of a k-form 


w = ť T Ie AAN br, 


is the (k + 1)-form 
do =) (dFy..,) A Axi, Ao A dx, 


where d F; i, 1s computed as the exterior derivative of a 0-form. 


EXAMPLE 1 If 
f (X1, X2, X3, X4, X5, X6) = X1X2X3 + X4X5X6, 


then 


df = x2x3 dxı + X1X3 dX + x1xX2 dx3 + xs5Xx6 dx4 + x4x6 dxs + x4x5 dxe. ¢® 
EXAMPLE 2 Ifo is the 1-form 
œ = xx dx, + x2x3 dx2 + (2x1 — x2) d x3, 
then 
dæ = d(xıx2) A dx, + d(x2x3) A dx2 + d(2x1ı — x2) A dx3 
= (x2 dx, + xı dx2) A dx, + (x3 dx2 + x2 dx3) A dx2 + (2dxı— dx2) A dx3. 


Using the distributivity property in Proposition 1.4 and the facts that dx; A dx; = 0 
and dx; ^A dx; = —dx; A dxi, we have 
dw = xı dx) A dx, + x2 dx3 A dx. + 2 dxı Adx3 — dx A dx3 


= —xı dx; A dx. + 2dxı A dx3 — (x. + l)dx2 A dx3. + 


Stokes’s Theorem for k-forms 


We now can state a generalization of Stokes’s theorem to smooth parametrized 
k-manifolds in R”. 


THEOREM 3.2 (GENERALIZED STOKES’S THEOREM) Let D C RÝ be a closed, 
bounded, connected region, and let M = X(D) be an oriented, parametrized k- 
manifold in R”. If dM # Ø, let ƏM be given the orientation induced from that 
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of M. Let w denote a (k — 1)-form defined on an open set in R” that contains M. 


Then 
[ a= w. 
M aM 


If OM = Ø, then we take f au ® to be 0 in the preceding equation. 


We make no attempt to prove Theorem 3.2.! Instead, we content ourselves 
for the moment by checking its correctness in a particular instance. 


EXAMPLE 3 We verify the generalized Stokes’s theorem (Theorem 3.2) for the 
2-form w = zw dx A dy, where M is the 3-manifold M = {(x, y, z, w) € Rf | 
w = x? + y? +z, x? + y? +z? < 1} oriented by the 3-form Q corresponding 


to the unit normal 


7 (2x, 2y, 2z, —1) 
o 4x2 + 4y?2 442241 


The manifold M is a portion of the 3-manifold given in Example 7 of §8.2 
and may be parametrized as 


X:B > R, X(u,, uz, u3) = (u1, u2, U3, U? + u3 + u3), 


where B = {(u1, u2, u3) | u? + us + u? < 1}. Using this parametrization, we 
have 
Tu, = (1; 0, 0, 2u) 


T„, = (0, 1, 0, 2u2) 
Ta, = (0, 0, 1, 2u3) 

(2u1, 2u2, 2u3, —1) 
VEETETTENT 


N= 


so the orientation 3-form Q is given by 
xw (ar, a, a3) = det [N a; a2 az]. 


Example 7 of §8.2 shows that the parametrization X is compatible with this 
orientation. Hence, we may use this parametrization without any adjustments 
when we calculate fy} da. 

The boundary of MisdM = {(x, y, z, w) | x? + y? + z? = w = l}andmay 
be parametrized as 


Y:[0, 2] x [0, 277) > R*, Y(sq, s2) = (sin sı cos s2, sin sı sin s2, cos sy, 1). 


Then 
T;, = (cos sı COS s2, COS s1 Sin s2, — sin s1, 0) 
and 


Ts, = (— sin sı sin s2, sin sı cos s2, 0, 0). 


' For a full and rigorous discussion of differential forms and the generalized Stokes’s theorem, see J. R. 
Munkres, Analysis on Manifolds (Addison-Wesley, 1991), Chapters 6 and 7. 
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An outward-pointing unit vector V = (v1, v2, v3, v4) tangent to M and normal to 
ƏM must satisfy 
e V.N = 0 along 0M; 
e V.T, =V-T,, =0. 
Along 0M, we have 


1 
N = — (2 sin sı cos s2, 2 sin sı sin s2, 2 cos s1, — 1). 
V5 


Thus, V must satisfy the system of equations 
(2 sin sı cos s2)v; + (2 sin sı sin s2)v2 + (2 cos sı )v3 — v4 = 0 


(cos sı cos $2)vı + (cos sı sin s2)v2 — (sin sı)v3 = 0 


—(sin sı sin s2)vı + (sin sı cos s2)v2 = 0 
After some manipulation, one finds that the unit vector that satisfies these equa- 
tions and also points away from M is 


Kass f : 
V = — (sin sı COS $2, SiN S1 SIN S2, COS $1, 2). 


V5 
Then the induced orientation 2-form Q°” for dM is given by 
2H (ai, a2) = x(V, a1, a2), 
where X(u) = Y(s). In particular, we have 


M (Ta, To) = det[N V T, T,] 


2 . 1 . . P 
a L s s —sin sı sin 
F sinsı Coss, Fz sinsi Coss, Coss; Coss; —sin sı sins, 
= sin sı sin s2 5 sin sı Sins, Coss; sins, sins, COs s2 
= det ‘ i , j 
= L —sin 
Jz OSS) F COS S sin sı 
=a 2 | 
2 i 0 0 
= sins; > 0 


for 0 < sı < x. Hence, the parametrization Y of dM, when smooth, is compatible 
with the induced orientation, so we may use this parametrization to calculate 


fiyo 
aM 
Now we are ready to integrate. We first compute f m a. Since w = zw dx A 


dy, we have 
dw = d(zw) A dx A dy = (zdw + wdz) A dx A dy 


=zdwAdx Ady +wdzAdx Ady. 
Thus, 


/ aw = [ff daoxu)(Tu,,Tu,,Tu;) dui duz dug 
M B 
af {u3dw A dx A dy(Tu,, Tu, Tus) 
B 


+ (u? + u2 + U3) dzAdx Ady(Ty,, Ty; T,,)} du, du du; 
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2uy 2u  2u3 

z i J / ait o o 
B 0 1 0 
pe! 


=2u3 
0 0 1 
+u? + u? + u?) 1 0 Of} du;dudu3 
0 1 0 


>f] (u? +u? + 3u3) du; du duz. 
B 


Since B is a solid unit ball, the easiest way to evaluate this iterated integral is to 
use spherical coordinates p, gy, and 0. Hence, 


20 m 1 
/ dw = / Í / (o + 20? cos? 9) p sing dp dy d0 
M o Jo Jo 


27 x 1 
= i 1 | pî (sing + 2 cos? g sing) dp do dé 
o Jo Jo 
27 x] 
= | l z (sing +2 cos? psing) dg d0 
o Jo 


1 [ 2 ‘ T 
= cos cos 
5 J a eds 


1 (710 4 
= 1 do = —. 
Se 3 3 


dé 


g=0 


On the other hand, 


/ o= ff wys)(Ts,, Ts,) ds1 dso 
aM [0,7 ]x[0,27) 


20 m 
= i Í cos sı dx A dy(Ts,, Ts,) ds) ds2 
0 0 


2x T 
= / | COS S1 
0 0 


27 m 
= / f cos sı (cos sı sin s1) ds; ds2 
0 Jo 


IE 
-| if cos? sı sin sı ds; ds> 
0 0 
ms 2x 2 4r 
ds) = Í = ds» = —. 
sı=0 0 3 


2x 
2r 1 
= (500551) l 3 


Therefore, the generalized Stokes’s theorem is verified in this case. + 


COS S] COSS2 — sin sı sin s2 
F : ds, ds 
cos s;Sinsz sin Ss, COS $2 
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Besides being notationally elegant, the integral formula in Theorem 3.2 beau- 
tifully encompasses all three of the major results of vector analysis, as we now 
show. 

First, let œ be a 1-form defined on an open set U in R?. Then 


w= M(x, y)dx + N(x, y)dy, 
so that 


dw =dM Adx+dN Ady 


aM aM aN aN 
= (ar May) nav ( dx 4 dy) dy 
y 


Ox Ox dy 
ƏN dM 

= | — — — }dx Ady. 
Ox dy 


The generalized Stokes’s theorem (Theorem 3.2) says that if D is a 2-manifold 
contained in U and 0D is given the induced orientation (see Figure 8.9), then 


J dw = J w, 
D aD 
or, in this instance, that 


dN OM 
Tf (Z — M) axdy =f Mdx + Nady, 
p \ 0x dy aD 


which is Green’s theorem. 


y y 
aD 

xX X 
f,do=Jf (2 = aM) dx dy Jp =$,,M dx + N dy 


Figure 8.9 The generalized Stokes’s theorem implies Green’s theorem. 


Next, suppose w is a 1-form defined on an open set U in R?. Then 
w = Fi(x, y,z)dx + Fy(x, y, z)dy + F3(x, y, z) dz. 
It follows that 


OF; OF: OF, OF 
do = | = - 2 dy Adz + 2 3) az a dx 
dy Oz 0z ax 


OF OF 
(2 = a) ax Ady. 
Ox dy 
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Recall from Proposition 2.2 that if S is a parametrized 2-manifold (surface in R3), 


then 
/ o=§ F -ds, 
as as 


where F = Fi, i+ F)j+ F; k. From Proposition 2.4, 


[a= | [e-as, 
s s 
where 


OF; OF OF, OF OF, OF 
G=(— 2 \i4 z $ he = 1\)k=VxF. 
dy Oz Oz Ox ox dy 
Theorem 3.2 tells us, if S is oriented and 0S is given the induced orientation, that 


J o= f do, 

as S 

f F-ds= | f V xF-as, 
as Ss 


which is the classical Stokes’s theorem. (See Figure 8.10.) 


or, equivalently, that 


J,do=JJ,VxF-ds fso =,5F - ds 


Figure 8.10 The generalized Stokes’s theorem gives the classical Stokes’s 
theorem. 


Finally, let w be a 2-form defined on an open set in RÌ. So 
w = F(x, y,z)dy ^A dz + Fy(x, y,z)dz ^ dx + F3(x, y, z)dx A dy. 


You can check that 


do = 


= H H =) dx A dy A dz. 
Ox dy Oz 

If D is a region in R, then D is automatically a parametrized 3-manifold, since 
the map X: D —> R$, X(x, y, z) = (x, y, z) parametrizes D. (One can show that 
in this instance D is always orientable as well.) If D is bounded and 0 D (which is 
a surface) is given the induced orientation (i.e., outward-pointing normal), then 


w 


w = Fi dx + Fz dy + F; dz Vector field 


@ = Fı dy Adz + Fdz ^Adx + F; dx A dy Vector field 
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| o= f F-dS, 
aD aD 


where F = Fi i+ F)j+ F; k. From Example 6 of 88.2, 


dF, aF) OF 
[ao= f( Pe Vax adyndc= fff v-rav. 
D D Ox dy az D 


Theorem 3.2 indicates that f, p œ = fp dw or 


fp r-as= fff v-rav, 


which is, of course, Gauss’s theorem. (See Figure 8.11.) 


Proposition 2.4 states that 


Z 


Y 


Xx Xx 
J,do = fj], V: F av [,,0 = pE as 


Figure 8.11 The generalized Stokes’s theorem gives 
rise to Gauss’s theorem. 


In the foregoing remarks, we have implicitly set up a sort of “dictionary” 
between the language of differential forms and exterior derivatives and that of 
scalar and vector fields. To be explicit, see the table of correspondences shown in 
Figure 8.12. 

The theorems of Green, Stokes, and Gauss all arise from Theorem 3.2 
applied to 1-forms and 2-forms. The next question is, can the “dictionary” and 
Theorem 3.2 provide a corresponding result for 0-forms? The generalized Stokes’s 
theorem (Theorem 3.2) states, for a 0-form w and an oriented parametrized curve 


C, that 
[w= w. 
Č ac 


Differential k-form Derivative 


Scalar field f 


F= Fhi+ P)j+hk 


F= Fihi+ P)j+hk 


Figure 8.12 A differential forms—vector fields dictionary. 
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C +eB 
-¢A 


Figure 8.13 The orientation 
of the curve C induces an 
orientation of its boundary 
(i.e., the endpoints A and B). 


8.3 Exercises 


In Exercises 1—7, determine dw, where w is as indicated. 


10=e™”% 


. w= xy —2xz? + xyz 


. œ = (x? + y’)dx + xy dy 
@ = xı dx — x2 dx; + x3Xx4 dX4 — X4X5 d x5 


. œ = xzdx ^ dy — y*zdx Adz 


Now, if C is closed, then 3C is empty (and so f, c © = 0). But if C is not closed, 
then ƏC consists of just two points. In that case, what should he @ mean? In 
particular, to apply Theorem 3.2, we must orient dC in a manner that is consistent 
with the orientation of C, which can be done by assigning a “—” sign to the initial 
point A of C and a “+” sign to the terminal point B. (See Figure 8.13.) Then 
Í. ac @ is just f (B) — f(A), where f is the function (scalar field) corresponding 
to w in the table. Since dw corresponds to V f, Theorem 3.2 tells us that 


[ Vf-ds= f(B)— f(A), (1) 


the result of Theorem 3.3 in Chapter 6. 

Finally, for the case n = 1, that is, the case of 0-forms (functions) on R, the 
0-form w corresponds to a function f of a single variable, and V f is the ordinary 
derivative f’. Furthermore, a parametrized curve in R is simply a closed interval 
[a, b]. Then equation (1) reduces to 


b 

f(x) dx = f(b) — f(a), 
a version of the fundamental theorem of calculus. Thus, we can appreciate that 
the generalized Stokes’s theorem is an elegant and powerful generalization of the 
fundamental theorem of calculus to arbitrary dimensions. 


11. Verify the generalized Stokes’s theorem (Theorem 3.2) 
for the 3-manifold 


M={(x, y,z, w) € R*|x=8 — Zy — 2z — 2w, x > 0} 


and the 2-form w = xy dz A dw. (Hint: First compute 
fim œ- To calculate fy dw, study Example 3 of this 
section.) 


12. (a) Let M be a parametrized 3-manifold in R? (i.e., a 
solid). Show that 


2 
3 
4. 
5 
6 


10. 


© @ = X1X2xX3 dx A dx3 A dx4 + X2x3X4dx1 A dX ^ 


dx3 


; ae = 
. w=}; xP dxi A+++ A dx; A+++ A dx, (Note: dx; 


means that the term dx; is omitted.) 


Let u be a unit vector and f a differentiable function. 
Show that dfx, (u) = Du f (Xo). (Recall that Du f (X0) 
denotes the directional derivative of f at xq in the di- 
rection of u.) 


Ifw = F(x, z)dy + G(x, y)dz is a (differentiable) 1- 
form on R?, what can F and G be so that dw = 
zdx Ady+ydx Adz? 


Verify the generalized Stokes’s theorem (Theorem 3.2) 
for the 3-manifold M of Exercise 11 of §8.2, where 
w = 2xdy A dz — z dx Ady. 


1 
Volume of = 5 | xdy A\dz—ydx Adz 
3 Jam 


+zdx Ady. 


(b) Let M be a parametrized n-manifold in R”. Ex- 
plain why we should have 


n-dimensional volume of M 


1 
=: f xı dx2 ^+- A dXy 
aM 


n 
— x2 dx; Adx3 A+++ A dXn 
+4x3dx,; A dx. \dx4A+++AdXn+-:: 


+(—1)" lx, dx, A dx A+++ A dXp_1. 


True/False Exercises for Chapter 8 


. (dx Ady + dy A dz)((1, 0, 1), (0, —1, 3)) = 0. 

. dx; A dx A dx3 ^ dx4 = dx A dx4 A dx, A dx3. 
. There are 21 basic 5-forms in R’. 

. Ax, A dx = dx A dx. 


. (dx, A dx2) A dx3 = dx3 A (dx; A dx2). 


~on fF UU N = 


. If @ is a 3-form on R and n is a 5-form on R6, then 
OAN=NA®. 


7. If% is a 2-form on RË and n is a 3-form on RË, then 
OAN=NA®. 


8. dx Ady A dz(a, b, c) = —dz A dy A dx(a, ¢, b). 
9. dx; A dx (a, b) = —dx; A dx j(b, a). 


10. Let D = [0, 2] x [—1, 1] and let X: D —> R be given 
by 


X(s, t) = (s —¢, st, se’, 4t). 


Then M = X(D) is a smooth parametrized 2-manifold 


in R. 
11. Let D = [—2, 2] x [0,5] x [-3, 3] and let X: D > 
Rt be given by 


2 ,2 3.4 
X(u;, U2, U3) = (Uj U3, UZ COSU3, Uy — U2, U3U3). 


Then M = X(D) is a smooth parametrized 3-manifold 


in R4. 
12. If D = [0, 1] x [0, 1], then the underlying manifolds 
of X: D > R3, 
X(s, t) = (s cos 27t, s sin2zt, s°) 
and Y: D > R?, 


Y(s, t) = (t cos2xs, tsin2z7s, j 
are the same. 


13. Let w=dx^dy and D= [0, 1] x [0, 1]. 
fxe = fy ©, where X: D > R, 


Then 


X(s, t) = (s cos 27t, s sin2zt, 8’), 


Miscellaneous Exercises for Chapter 8 


1. Let œ be a k-form, 7 an /-form. Show that 
d(w an) = dw An+(-1)'a A dn. 


This is accomplished by the following steps: 

(a) Show that the result is true when k = / = 0, that is, 
when w = f and 7 = g. (Here f and g are scalar- 
valued functions.) 


14. 


15. 


16. 


17. 


18. 


19. 
20. 


. Let 
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and Y: D > R’, 
Y(s, t) = (t cos2xs, tsin2zs, tj: 


Let B={uc R | us +u? + uy < 1}. The general- 
ized paraboloid X: B —> Rĉ defined by 


X(u1, u2, U3) = (U1, U2, U3, U + 2u5 + 3u3) 


has as its boundary the ellipsoid Y: [0, m] x [0, 277) —> 
RÍ, 

L sinssint, 4 
> 2 A3 
Let M C R” be the graph of a function f:U C 
R""! — R” parametrized by X: U > R”, 


<» Un—1, fu, . 


Y(s, t) = (sins cost coss, 1). 


X(u4,.. »,Un—1))- 


on Un—1) = (u1, . 
If 
us -es funa TI) 


N(u;, pp 
T +1 


-s Un 1) = 


is a unit normal, then the parametrization X is compat- 
ible with the (n — 1)-form Q defined by 


Ĉxu)(a1, (hae) det [ a) an-ı N ] 7 


If w= xix; dx Adx4, then dw = x3 dx; A dx. A 
dx4 + xı dx. A dx3 A dx4. 


If @ = xı dx3 — x2 dx; + x1x2x3 dx3, then 
dw = (x2x3 + 1) dx; A dx3 + dx, A dx 
+x1xX3 dx A dx3. 
If œw = xix: dx, A dx2 + x2x3 dx, A dx3 + x1x3 dx2 
A dx3, then 
dw = 2x; dxı A dx2 A dx3. 
If w is an n-form on R”, then dw = 0. 


If M is a parametrized k-manifold without boundary 
in R” and w is (k — 1)-form defined on an open set 
containing M, then fy dw = 0. 


(b) Establish the result when k = 0 and/ > 0. 
(c) Establish the result when k > 0 and/ = 0. 
(d) Establish the result when k and l are both positive. 


M be the subset of R 
{(x1, x2, X3, X4, X5) | X5 = X1X2X3X4, 


described as 
0 < x1, X2, X3, 


X4 < 1}. 
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(a) Give a parametrization for M (as a 4-manifold) 
and check that your parametrization is compati- 
ble with the orientation 4-form Q = dx, A dx2 A 
dx3 \ dx4. 


(b) Calculate te x4 dx, A dx. A dx3 A dxs. 


(a) Let C be the curve in R? given by y = f(x), 
a<x <b. Assume that f is of class C!. If C 
is oriented by the direction in which x increases, 
show that if œ = y dx, then 


/ œ = area under the graph of f. 
c 


(b) Let S be the surface in R? given by the equation z = 
f(x,y), where (x, y) € [a, b] x [c,d]. Assume 
that f is of class C!. If S is oriented by upward- 
pointing normal, show that ifw = z dx A dy, then 


Í w = volume under the graph of f. 
s 


(c) Now we generalize parts (a) and (b) as follows: 
Suppose f: D — R is a function of class C! de- 
fined on a connected region D C R"~!. Let M be 
the (n — 1)-dimensional hypersurface in R” de- 
fined by the equation x, = f(x1,..., Xn—1), where 
(%1,---,;Xn-1) € D. Iw = dadi A+++ A dxn-1, 
show that 


i @ = +(n-dimensional volume 
M 
under the graph of f). 


How can we guarantee a “+” sign in the equation? 


. Let M be the portion of the cylinder x? +z? = 1, 


0 < y <3, oriented by unit normal N = (x, 0, z). 

(a) Use N to give an orientation 2-form Q for M. Find 
a parametrization for M compatible with Q. 

(b) Identify 0M and parametrize it. 

(c) Determine the orientation form Q?” for ƏM in- 
duced from Q of part (a). 


(d) Verify the generalized Stokes’s theorem (Theorem 
3.2) for M and w = zdx + (x + y + 2z)dy—x dz. 


. Use the generalized Stokes’s theorem to calcu- 


late fa œw, where St denotes the unit 4-sphere 
{(x1, x2, X3, x4, X5) € RÍ | yr + x3 +x} +x} +x2 = 
1} and w=x3dx, Adx. A dx4 A dx5+x4dx, A 
dx \ dx3 A dxs. 


. (a) Let w be a 0-form (i.e., a function) of class C?. 


Show that d(dw) = 0. 
(b) Now suppose that is a k-form of class C?, mean- 
ing that when w is written as 


a Fi, i, AXj, A+++ A Xi, 


1 <i) <++-<ip<n 


10. 


each F;, i, is of class C°. Use part (a) and the result 
of Exercise 1 to show that d(dw) = 0. 


. In this problem, show that the equation d(dw) = 0 im- 


plies two well-known results about scalar and vector 
fields. 


(a) First, let w be a 0-form (of class C”). Then w corre- 
sponds to a scalar field f. Use the chart on page 559 
to interpret the equation d(dw) = 0. 


(b) Next, suppose that w is a 1-form (again of class 
C*). Then w corresponds to a vector field. Inter- 
pret the equation d(dw) = 0 in this case. 

Let 

xdy \dz+ydzAdx+zdx Ady 
C2+y24+ 27 


(a) Evaluate f, œ, where S is the unit sphere x? + 
y? +z? = 1, oriented by outward normal. 
(b) Calculate dw. 


Figure 8.14 Figure for Exercise 8. 


(c) Verify Theorem 3.2 over the region M= 
{(x, y, z) | a? < x? +y? +2? < 1}, wherea Æ 0. 

(d) Now let M be the solid unit ball x? + y? +z? < 1. 
Does Theorem 3.2 hold for M and w? Why or why 
not? 

(e) Suppose that S is any closed, bounded surface that 
lies entirely outside the sphere Se = {(x, y, z) | 
x? + y? + 2? = e°}. (See Figure 8.14.) Argue that if 
S is oriented by outward normal, then f, œ = 47. 


. Let M be an oriented (k + L + 1)-manifold in R”; let 


w be a k-form and 7 an /-form defined on an open set 
of R” that contains M. If JM = Ø, use Theorem 3.2 
and Exercise | to show that 


f aonn= f w^ dn. 
M M 


Let M be an oriented k-manifold. Use Exercise 1 and 
the general version of Stokes’s theorem to establish 
“integration by parts” for k-forms w and 0-forms f: 


[ fa = | fo- f afro. 


Suggestions for Further 


Reading 


General 


Francis J. Flanigan and Jerry L. Kazdan, Calculus Two: Linear 
and Nonlinear Functions, 2nd ed., Springer, 1990. The essen- 
tials of vector calculus are presented from a linear-algebraic 
perspective. 


John H. Hubbard and Barbara Burke Hubbard, Vector Calculus, 
Linear Algebra, and Differential Forms: A Unified Approach, 
2nd ed., Prentice Hall, 2002. Treats the main topics of multi- 
variable calculus, plus a significant amount of linear algebra. 
More sophisticated in approach than the current book, using 
differential forms to treat integration. 


Jerrold E. Marsden and Anthony J. Tromba, Vector Calculus, 
5th ed., W. H. Freeman, 2003. This text is probably the one most 
similar to the current book in both coverage and approach, 
using matrices and vectors to treat multivariable calculus 
in R”. 


Jerrold E. Marsden, Anthony J. Tromba, and Alan Weinstein, 
Basic Multivariable Calculus, Springer/W. H. Freeman, 1993. 
Somewhat similar to Marsden and Tromba’s Vector Calculus, 
but with less emphasis on a rigorous development of the sub- 
ject. A good guide to the main ideas. 


George B. Thomas and Ross L. Finney, Calculus and Ana- 
lytic Geometry, 9th ed., Addison-Wesley, 1996. A complete 
treatment of the techniques of both single-variable and multi- 
variable calculus. No linear algebra needed. A good reference 
for the main methods of vector calculus of functions of two 
and three variables. 


Richard E. Williamson, Richard H. Crowell, and Hale F. 
Trotter, Calculus of Vector Functions, 3rd ed., Prentice Hall, 
1972. A smooth and careful treatment of multivariable calculus 
and vector analysis, using linear algebra. 


More Advanced Treatments 


The following texts all offer relatively rigorous and theoretical 
developments of the main results of multivariable calculus. As 
such, they are especially useful for studying the foundations of 
the subject. 


Tom M. Apostol, Mathematical Analysis: A Modern Approach 
to Advanced Calculus, 1st ed., Addison-Wesley, 1957. Look at 
the first edition, since the second edition contains much less 
regarding multivariable topics. 


R. Creighton Buck, Advanced Calculus, 3rd ed., McGraw-Hill, 
1978. Treats foundational issues in both single-variable and 
multivariable calculus. Uses the notation of differential forms 
for considering Green’s, Stokes’s, and Gauss’s theorems. 


Richard Courant and Fritz John, Introduction to Calculus and 
Analysis, Vol. Two, Wiley-Interscience, 1974. A famous and 
encyclopedic work on the analysis of functions of more than 
one variable (Volume One treats functions of a single variable), 
with fascinating examples. 


Wilfred Kaplan, Advanced Calculus, 3rd ed., Addison-Wesley, 
1984. A full treatment of advanced calculus of functions of two 
and three variables, plus material on calculus of functions of n 
variables (including some discussion of tensors). In addition, 
there are chapters on infinite series, differential equations, and 
functions of a complex variable. 


O. D. Kellogg, Foundations of Potential Theory, originally pub- 
lished by Springer, 1929. Reprinted by Dover Publications, 
1954. A classic work that ventures well beyond the subject of 
the current book. The writing style may seem somewhat old- 
fashioned, but Kellogg includes details of certain arguments 
that are difficult to find anywhere else. 


James R. Munkres, Analysis on Manifolds, Addison-Wesley, 
1991. A superbly well written and sophisticated treatment of 
calculus in R”. Requires a knowledge of linear algebra. In- 
cludes a full development of differential forms and exterior al- 
gebra to treat integration. For advanced mathematics students. 


David V. Widder, Advanced Calculus, 2nd ed., originally pub- 
lished by Prentice-Hall, 1961. Reprinted by Dover Publica- 
tions, 1989. Careful treatment of differentiation and integra- 
tion of functions of one and several variables. Chapters on 
differential geometry, too. 


Physics Oriented Texts 


Mary L. Boas, Mathematical Methods in the Physical Sciences, 
2nd ed., Wiley, 1983. A wide variety of topics that extend well 
beyond vector calculus. For the student interested in physics as 
well as mathematics. 


Harry F. Davis and Arthur D. Snider, Introduction to Vector 
Analysis, 6th ed., William C. Brown, 1991. A detailed and rel- 
atively sophisticated treatment of multivariable topics. Includes 
appendices on classical mechanics and electromagnetism. 
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Suggestions for Further Reading 


Edward M. Purcell, Electricity and Magnetism, 2nd ed., 
McGraw-Hill, 1985. This is a physics, not a mathematics, text. 
Provides excellent intuition regarding the meaning of line and 
surface integrals, differential operations on vector and scalar 
fields, and the significance of vector analysis. 


Other 


Alfred Gray, Modern Differential Geometry of Curves and Sur- 


faces with Mathematica®, 2nd ed., CRC Press, 1998. Not 
a vector calculus text by any means, but rather a delightful 


library of geometric objects and how to understand them via 
Mathematica. Some of the differential geometric topics are 
somewhat remote from the subject of the current book, but the 
many illustrations are worth viewing. An outstanding aid for 
developing one’s visualization skills. 


H. M. Schey, Div, Grad, Curl, and All That: An Informal Text 
on Vector Calculus, 3rd ed., W. W. Norton, 1997. The classic 
“alternative” book on vector analysis, aimed at students of elec- 
tricity and magnetism. Brief, but well done, intuitive account 
of vector analysis. 


Answers to Selected 
Exercises 


Chapter 1 


Section 1.1 


a) y 


(2,1) 


O y 
(3,3) 


(c) y 


(-1, 2) 


(b) (—16, —24) 
(c) (5, 15) 

(d) (—19, 25) 
(e) (8, —26) 


b (translated) 


Te 


9: 
T1; 
T3; 


15: 
17. 


21. 


23. 


25: 


21: 


— — 
(a) AB = (—4, 3, —1), BA = (4, —3, 1) 

— — —> — 
(b) AC =(1,1,3), BC =(5,-2,4, AC+CB= 


(—4, 3, —1) 
(c) In general, we have 


x=14,y=16,z=8 

Because b = Sa. 

(1, 2,3,4)+ (5, —1, 2, 0) = (6, 1, 5, 4) 

2(7, 6, —3, 1) = (14, 12, —6, 2) 

a+ b = (ai + bi, ..., an + bn); ka = (kay,..., kan) 

— 

Pi P, = (1, —4, —1, 1) 

If your displacement vector is a and your friend’s is b, then 

b — a is the displacement vector from you to your friend. 

(b) The position vectors of points in the parallelogram 
determined by (2, 2, 1) and (0, 3, 2) 

(a) /5 units per minute 

(b) (0, —4) 

(c) 7 minutes 

(d) No 

(a) (5,5, 4) 

(b) (—5, —5, —4) 

F; = (4, —4, 2) and F, = (—4, 4, 8) 


Section 1.2 


fi: 
5. 


2i+ 4j 3. 3i+- xj — 7k 
2i+ 4j 7. (9, —2, V2) 
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9. 
11. 


13. 
15. 
17. 
19. 
21. 


23: 
25. 
27. 


29. 


3l: 


33. 
35. 


37. 
45. 


47. 


Answers to Selected Exercises 


(x, —1) 

(a) b = 2a; +a) 

(b) b = — la; + 4a) 

(c) Take Cı = (bi + b2)/2, C2 = (bi = by) /2. 

x=t+2,y=3t-—1,z=5-6t 

x=t+2,y=-T7t-1 

x=t+ly=4,z=5-6t 

xy = l — 2t, x2 = 5t + 2, x3 = 3t,x4 = 7t+4 

(a) x =2t-—l,y=7-t,z=5t+3 

(b) x =5-— 5t, y =4t-—3,z=5t+4 

(c) One alternative for (a): x=—2t—1,y=t+7, 
z = 3 — 5t; one alternative for (b): x = 5t, y = 1 — 
4t,z=9-—5t 

(d) For (a): œ + 1)/2 = (y —7)/(-1) = & — 3)/5; 
for (b): (x — 5)/(—5) = (Y + 3)/4 = @ — 4)/5; 


for (e): (& + 1I)/(—2) = O = D/1 = & — 3)/(-S) 
and x/5 = (y — 1)/(-4) = k — 9)/(-5). 
(x = 7)/1 = ( + 9)/3 = (z — 6)/(-8) 
x=3t-—S,y=7t+1,z=-—2r-10 


Multiply the first symmetric form by — 5. Then add —5 
to each “side.” 

No. Setting ¢ = 0 in the equations for lı gives the point 
(2, —7, 1). To have x = 2 in h, we must take t = L, But 
t= 5 gives the point (2, —7, 2) on l2, not (2, —7, 1). 

No, they do not. Note that x > —1, y > 3, and z < 1 


only. The parametric equations define a ray with endpoint 
(-1, 3, 1). 


(=14, 5, —18) 

With x = 0: (0, $, 4); with y = 0: (— 4,0, 2); with 
z = Q0: (7, 17, 0). 

No 43. No 


(a) Circle x? + y? = 4, traced once counterclockwise. If 
0 < t < 2m, circle is traced three times. 

(b) Circle x? + y? = 25, traced once counterclockwise. 

(c) Circle x? + y? = 25, traced once clockwise. 

(d) Ellipse x?/25 + y?/9 = 1, traced once counterclock- 
wise. 


x = (a + a0) cos, y = (a + a0)sin0 


Section 1.3 


1s 
3. 
5. 
Z: 
13. 
17. 
19: 
21. 


a+b = 13, llall = 26; ||bl| = v13 
a+b = —44, |jall = 50; ||bl] = 2/14 
a+b =2, llall = v26; ||bl| = V3 


27/3 9. cos! (v2/v3) 11. 2/2 
3(i+j) 15. 2k 
(2i-j+k)/v6 

V3(i+ j—k) 

Yes, ifa-b=0ora=b. 


25. 


27. 
29. 
33. 


— 
(a) Work = (component of F)| PO | 
— — 
= |F| cos | PO| =F-PO 
(b) 2 
10,000 ft-lb 
cosa = 3/5, cos 6 = 0, cosy = 4/5 
Hint: The diagonals are given by dı = a + b and d} = 


b — a, where a and b determine the adjacent sides of the 
parallelogram. 


Section 1.4 


1. 

5. 

9. 
13 


15. 


21. 


23. 
25. 


35. 


37: 


2 3. —5 
81, —5, 8) 7. 5k 
—6i + 14j + 4k 11. 54/30 


. € is parallel to the plane determined by a and b (or else a 


is parallel to b). 


1002/2 17. 3/34/2 19. 53 
a, a a 
(axb)-c=| bi b bs |, 
ĉi C2 63 
by b by 
(bXec)-a=|] cr C2 C3 
a, a2 a3 


Expand determinants to see they are equal. 
(b) 5 

(a) axb 

(b) 2a x b/|la x bl 


llall 

(e) a x (b x ¢) 
(f) (ax b)xe 
(a) 20/3 ft-lb 
(b) 30V/3 ft-lb 


75/3 ft-lb 39. 4007/3 in/min 


. Rotation about an axis that passes through the rigid body 


is very different from rotation about a parallel axis that 
does not pass through the body. 


Section 1.5 


1 
3 
5. 
7 
l; 


:. X= y2 = 8 
. 5x —4y4+3z=25 


5x — 4y +z = 12 


~x—-yt7z=5 


3x —2y—z=3 


13. 
15: 
17. 
19. 
Zk 
23. 
25. 
29. 
31: 
35. 
37. 


141 + 4,7 = —5t 


x=13t-3,y= - 


—4/3 

x=2s+t—-—1l,y=2-—3s,z=s—S5t+7 
x=2s+5t—1,y = 10t — 3s + 3,z = 4s + 7t — 2 
x = 3s + 3t — 5, y = 10 — 3s — 6t,z = 2s — 2t + 9 
19x — 16y + 7z = 59 

31/34 27. 25/V641 

(a) 0 (b) The lines must intersect. 
[14/2 


x +3y — 5z = 2 + 3/35 


Hint: Consider Example 8 in §1.5 with A as Pı and B 
as P2. 


Section 1.6 


1. 


(a) e; + 2e2 + 363 +--+ nen 
(b) e; — €63 + €4 — €65 H°- + ey_-2 — ên 


. (1, -2, 3, —4, ..., (=D +'n) 
. @ 8,1,11, —1,..., 2n +(-1)"+'2n — 1) 


(b) (—1,7,—1,..., 2n + (—1)"2n — 1) 
(c) (=3, —9, -15,..., —6n + 3) 

(d) /1+9425+---+(2n—1) 

(e) 2—12+30+---+(-1)"t!2n(2n — 1) 


. Hint: Use the triangle inequality. 


. Hint: Square both sides of the equation and write as dot 


products. 


. Hyperplane in R° passing through (1, —2, 0, 4, —1) and 


perpendicular to the vector n = (2, 3, —7, 1, —5) 


. (a) Total cost = 


(200, 250, 300, 375, 450, 500) « (x1, X2, X3, X4, X5, X6) 
(b) {x € RÉ | (200, ..., 500) + (x1, . - . , x6) < 100,000}. 


Budget hyperplane is 200x; + --- + 500x6 = 100,000. 


5 8 8 
“|= 2 Sr 2 


-4 0 
15 —9 
5 0 
. —42 23. —240 


. (a) A lower triangular matrix is an n x n matrix whose 


entries above the main diagonal are zero. 
(b) For an upper triangular matrix, use cofactor expansion 
about the first column or last row. 


. —289 
1/2 —1 1/2 
0 1 0 
0 0 =i 


567 


Answers to Selected Exercises 


I a 1 

7 14 7 

2 5 4 

a 7 4 7 
—1 elt 2 

7 14 7 


39. (32, 46, 19, —35) 


Section 1.7 

1, CA 

5. (2/2, 37/4) 

9. (-1/2, 3/2, -2) 
13. (0, 0, —2) 

17. (2/2, 2/6, 77/4) 


3. (3, 0) 
7. (2cos2, 2 sin 2, 2) 


11. (0. 33/2, 3/2) 
15. (2, 27/3, 13) 


19. (a) z 


0 = 7/2 


(b) 


23. Cartesian: y = 2; cylindrical: r sin@ = 2; surface is a ver- 
tical plane. 


25. Cartesian: x = y = 0; spherical: g = 0 or z; object is the 
z-axis. 


568 Answers to Selected Exercises 


27. 


29. 


31. 


33. 


35. 


37. 


41. 


45. 
47. 


Note: The desired region is that between the two spheres 
shown. 


(a) r = — f (0) is the reflection through the origin of 
r= f(9). 

(b) o = — f (ọ, 0) is the reflection through the origin of 
p = f(g, 9). 

(c) r = 3 f (0) is a threefold magnification of r = f(@). 

(d) 9p =3f(y,0) is a threefold magnification of 
p = f(g, 9). 

i = singcos6 e, + cospcosé ey — sin 0 eg 

j =sing sinf e, + cos ọ sin 0 €y + cos 0 eg 

k = cos g ep — sind €y 

(a) Hint: use part (a) of the previous exercise. 

Hint: use part (a) of the previous exercise. 


True/False Exercises for Chapter 1 


1. 
3: 


False. (The corresponding components must be equal.) 


False. ((—4, —3, —3) is the displacement vector from Pz 
to Pi.) 


5. False. (Velocity is a vector, but speed is a scalar.) 


N 


11. 


T3: 


T5. 
17. 
19. 
21. 
23: 
25. 
27. 
29. 


. False. (The particle will be at (2, —1) + 2(1, 3) = (4, 5).) 
. False. (From the parametric equations, we may read a vec- 


tor parallel to the line to be (—2, 4, 0). This vector is not 
parallel to (—2, 4, 7).) 

False. (The line has symmetric form 

eH 2 z+3 ) 


= — 1 = 
=3 2 

False. (The parametric equations describe a semicircle 

because of the restriction on t.) 


False. (||ka|| = |k||lall.) 
False. (Let a = b = i, and ¢ = j.) 
True 


True. (Check that each point satisfies the equation.) 
False. (The product BA is not defined.) 

False. (det (2A) = 2” det A.) 

False. (The surface with equation p = 4 cos ¢ is a sphere.) 
True 


Miscellaneous Exercises for Chapter 1 


3. 
5; 


29. 


31. 


33. 


x = 63t + 1, y = 1481, z = 847t — 2 


(@) x= 5642, p—31=2 
(b) x = (b2 — a)t + (aq, + bı)/2, 
(a2 + b2)/2 


y= (a — bı + 


. (a) 2x1 + 4x2 — 2x3 + x4 — x5 = 5 


(b) (bi a a)xı pese (bn = An)Xn = ib? = a? + 


+ BE =a) 
. (a) No (b) No 
. (a) 2/3 
(b) x=t,y=1-t,z=t 
. Planes (a) and (d) are parallel; (b) and (e) are the same; 


(c) is perpendicular to (b). 


. The dot product measures the agreement of the answers. 


. Hint: a xb is normal to the plane determined by a 


and b. 
. (a) 5/21 (b) 10 
. (b) 1 


. Hint: Note that to determine x, it suffices to determine ||x|| 


and the angle between a and x. Consider the cases c = 0 
and c Æ 0 separately. 


(a) Hint: Let a and b denote two adjacent sides of the par- 
allelogram. 


(b) Hint: You should simply give a vector equation. 


lon 


wa"= 4 d 


1 
L344 & & 
tfif G 4 
2; 3 4 5 6 Fi 
i,#@ 2a a 4 
(a) H= 3 4 5 6 7 8 : 
i£ ey a a 
4 5 6 7 8 9 
i i-c 2 2 S 
5 6 Fi 8 9 10 
i tw a ok 
6 7 8 9 10 11 
Pe eee E 
ee = gt ago! 
bha — dee : 
4 = 6048000’ >: 266716800000" 
det Hs = l 
or = 786313420339200000 


(b) det Hio © 2.16418 x 1075 
(c) Most likely, AB and BA will not equal Jọ. 


35. 


37. 


39. 
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b 
x =(a+b)cost — bcos (=) 


b 
y= (a+ b)sini —bsin( r) 


b 


GG 
(>) 


x = (a+b)cost —ccos z t 


—b 
Hypotrochoid: x = (a — b) cos t + c cos (¢ r) ‘ 


y = (a — b)sint — c sin 


’ 
’ 


Epitrochoid: 


b 
y= (a+ b)sint = esin ( $ 1) 


(a) y 


(=) 

TIT TTT r r tr r rrr TT 

potbipiitiie tipi titi tdi 
= 


poritirirtiiiilivy 
0.25 0.50 0.75 1.00 


portitittiriitiiys 
-0.75-0.50-0.25 0 


S 
=) 
So 


(b) 
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45. (a) {((7,0,z)|0<r<3,0<06 < 27,0 <z <3} 


if the cylinder is positioned so that the center of the 
bottom is at the origin and its axis is the z-axis. 

(b) Using the same positioning as in part (a), {(p, 9, 0) | 
0< p <3secg,0<9<7/4,0< 0 < 2m} U 
{(0,9,9)|0< p < 3csco, 7/4 < o < x/2,0 < 


0 < 27}. 17. 


Chapter 2 


Section 2.1 


Ts 


on 


(a) Domain = R; range = {y | y > 1} 
(b) No 
(c) No 


. Domain = {(x, y) | y Æ 0}; range = R 
. Domain = R3; range = [0, 00) 
. Domain = {(x, y) | y # 1}; 


range = {(x, y, z) | y # 0, yz = (xy — y — 17 
+7 +1)} 


= u(x, y, Z,t) = xyzt, 


v(x, YZ t) = a = y’, 
u3(x, y, Z, t) =3z +t 


Tī; 


13. 


15; 


Answers to Selected Exercises 


(a) f(x) = —2x/lxl| 


(b) fix, y, z) = —2x// x2 + y? + 22, 
Ala. y, z) = —2y/ y2 + y + zZ, 
Alx, yz) =—2z/ yx +y +z 


(a) fix) = 2x1 — x3 — X4, 
hO = 3x2, 
f) = 2x1 — x3 — x4 
(b) Range = {(y1, y2, y3)lyı = y3} 
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23. 


19. 


l/ 
U 


«i 


Ñ f 
Ny a 
RU 


21. 
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Answers to Selected Exercises 


27. 


31. No 


35. 


BAUMAN 


PAN 


e 
w 


37. (a) 


29. (a) 


574 Answers to Selected Exercises 


39. 
41. Zz 
y 
E E 
tte 
“le x 
43. 


45. 


47. Cone with vertex at (1, —1, —3) 


49. Ellipsoid centered at (—1, 0, 0) 


Section 2.2 


1. Open 3. Neither 
5. Neither 7.2 

9. Does not exist 
13. 0 15. 0 17. 0 


11. Does not exist 


19. —1 


21. 
23. 
25; 


27. 


Limit does not exist. 
Limit does not exist. 


Limit does not exist. 


Limit is 0. 


. Limit does not exist. 
ree 
. Limit does not exist. 
. 0 
. Limit does not exist. 


. Continuous 

. Continuous 

. Not continuous at (0, 0) 

. Continuous 

. Hint: Write f(x) in terms of the components of x. 


Section 2.3 


. f(x,y) = y? + 2xy, yx, y) = Ixy + x? 
. f(x, y) = y cos xy — y sin xy, 


fy(x, y) = x cos xy — x sin xy 


. fa y) = hy e Hy r, 


ha, y) = —4x*y/(x? + yF 
3 


. f(x, y) = —3x°y sinx?y, 


A@y= —x3 sinx? y 


11. 


13. 


15. 


17. 


19. 
2i: 
23: 


25; 
21.: 


29. 


31; 


33. 


37. 


39. 


41 


45. 


47. 


Answers to Selected Exercises 


. f(x, y) = 2 + 2xy cos (x? + y), 


fy, y) = xe” + sin(x? + y) + y cos(x? + y) 
Fy = 1/(y +z), 

F, =-@+2)/O+2), 

F,=(y-*)/Q +z) 


Fy =x/ yx +y +z, 
Fy = y/ yx? + y? +2, 
Laie ty ee 


_ 1 — 2x? — 3xy + y? — 3xz4+ 2? 


F; = > 
(1+ x2 + y? + 22/9? 
7 1 +x? —3xy — 2y? — 3yz + z? 
y (+x y? Hz , 
p Lbs ty? — 3x2 — 3yz — 2z 
= (1+ x? + y? + 27)5/2 
n __ x4 = Bxye + 3x*z? + 3x? 
2o (x? +z? +1) , 

z 
Pec 
7 x? +2241 

2 3 2 

— 9x37 — 

ent? x°Z—ye+y 


` 0P bh iP 
ls 

-1i 

—i+ (2x + 1)j — 2k 


i+j—2k 
i— j+ 4k 
[-4 -3/e —3/e] 


0 2 0 
is 0 27/5 
3.7 Fo 

|: 0 2 = 

0 1 -17 3 
—2 0 

1 <i 
0 2 


(a) The function has continuous partial derivatives. 


(b) z= 3x +8y +3 
z=e(x+y) 


. X5 = —4x1 + 6x2 — 4x3 — 6x4 + 28 
43. 


(a) A(0.1, 0.1) = 1 
(b) (0.1, —0.1) = 1 

(a) A(1.01, 1.95, 2.2) = 21.76 

(b) f(1.01, 1.95, 2.2) = 21.6657 

(a) fx, y) = Gx* + 8x79? + D/O? + 9°), 


575 


fy, y) = =at + 4x3 y H 2x7 y? + ye + yY 


(b) f.(0, 0) = 3, fy(0, 0) = —1 
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49. (a) (8+ 4/2)x — 8y = /2(n — 4) (c) Partial derivatives are rational functions defined on 
(b) R?—hence, continuous—so f is differentiable at 
y (0, 0). 
sh E 57. (a) z = -4 [03r — 96/2)x — (16/20 + 72)y + 
25E (4/2 — 3/3)n? + (16/2 + 9) ] 
2.0E (b) 
= LT] 
1.5 E LOOT 
E OTIO 
ose. ERIE 
0 ArTr ri tir iitisiir tri x 
0 0.5 1.0 15 2.0 
51. (a) x+y =In2—-1 
b 
(b) / 
a r aa aaa 
1E J 
0 E = x 
-1 | 
2E J (c) Partial derivatives are products of polynomials and 
E a sine and cosine functions and, hence, are continuous. 
ee Thus, f is differentiable at (x /3, 77/4). 
59. Df(x) = A. Note that f'(x) = a in the one-variable case. 
53. (a) z= 11x —15 
(b) Section 2.4 
1. Dif +g) 
= [y — sinx + ycos(xy) x+3y?+xcos(xy)] 
3. D€ +2) 


_ [siny +3x?° cosx — x° sinx xcosy 1 
> —6x + yz e+xz yeb+xy 


5. D(fg)=[3x7+y?  2xy], 


D(f/g)=[y? —y*/x? axy + 4y3/x] 
7. DGB) 
= [12x3y + 3x?y° — 12xy? — 2y" 


Y0.50 


2.50 
4 3,4 2,2 6 
(c) Partial derivatives are polynomials—hence contin- B 3x4 + Sx? yt — 18x°y? — 14x35], 
uous—so f is differentiable at (2, 1). (f/8) 
55. (a) 0 _ pe — 6x? — 3x? y4) 3x3 + 6xy? + 5x7y4 — 2] 
` (b) z x(x? _ 2y?) x(2y? = x2? 
i 9. fer = Oxy", iii= 42x3 y° + 6x, 


Fey = fyx = 21x? y® + 6y —7 
11. ics = —ye™ F 2yx er og yx te”, fy = ae, 
Jey = Jyt =e*— x2er/* — yx er/* 
2[sin? 2x — cos 2x(sin? x +2e)] 
(sin? x + 2e”)3 
2e” (2e” — sin? x) 4e” sin 2x 
fy ea a ao Tey = tox = a 
(sinf x + 2e”) (sin? x + 2e”) 
15. fsx =—ysinx, fyy=xcosy, fry = fyx = cosx + 
sin y 


13. fix = 


> 


17. 


19. 


21. 


23. 
25. 


27. 


29. 


f= 20. (f= 2 He. fo = Fa = ee, 


Fez tex 0, Pye hey 0 

tex = 2yz, yy = 2x7, fzz = 2xy, fry = Styx = 2xz+ 
2yz+ zA Sez = fax = 2xy + y? + 2yz, fyz = Fa = 
x? +2xy + 2xz 

fex = beh cosz +a?e®™ siny, fyy = —e“ sin y, fez = 
—e’* cosz, fey = fys =ae™ cosy, fez = fa = 


—be™ sinz, fyz = fry = 0 

a” f/ax" = 3” ye% , a" f/ðy” = 0, forn > 2 

ə” f/ðx" _ (=1 n _ 1)!/x", a" f/ay" = (-1)""!(n = 

1)!/y”, 0” f/ðz” = (—1)"(n — 1)!/z”. All mixed partials 

are zero. 

(a) px and py have degree 16; pxx, Pyy, and pyy all have 
degree 15. 

(b) px and p, have degree 3; pyx has degree 2; pyy has 
undefined degree; p,, has degree 2. 

(c) The degree of 3‘ p/dx;,---dx;, is d —k, where d 


-xh such that, for j = 1,2,...,n, dj is at 
least the number of times x; occurs in the partial 
derivative. If p has no such term, then the degree of 
Ək p/əxi «+» Oxi, is undefined. 


(a) 


(b) 


33. 
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Section 2.5 


1. 


2: 


19. 


df/dt = (18t? + 14t) sin 2t — 6 cos 2t sin? 2t + (1243 + 
1417) cos 2t + 72t + 84 
af/ðs = (3s? + t? + 2st) cos((s + t)(s* + t)), 
af /dt = (s? + 3t? + 2st) cos((s + t)(s? + t7)) 
(a) dP/dt = (36 — 27x)/V2 atm/min 
(c) Approximately (.36 + (9 — .27))//2 
= 19.6477 atm 
—6 in*/min. Decreasing. 
0.766 units/month 
0.2244 cm/min 
Hint: dw/du = 2u(dw/dx) — 2u(dw/dy) from the chain 
rule. 
Hint: dw/dx = (dw/du)((y> — x7 y)/(x? + y?)?) from 
the chain rule. 


. Hint: dw/ax = —1/x?(@w/du + dw/dv) from the chain 


rule. 
a4 744 
Difo g) = | 15(s — 7t) 105(s — 7t y | 


ers 14 —]14e?s 714 
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21. Difog)=[UAts+ne* (1-—s-— tje] Section 2.6 

3s? — : — 2st irecti ivati i 
23. D(fog) = | Ss a 7 p | 1. (a) The directional derivative of f at (x,y,z) in the 

6s>f3 39%? — 25714 negative z-direction. 
(b) VS, y, z). (~k) = —df/dz 
e” (2sint + cost) 3. 85/5 

25. D(fog)= | e'(sin?t+ 2sint cost) 5. (2e — 9)V5/5 

3(sin? t cost + e”) 7. —4/3 e" 

9. (a) fr(0, 0) = f (0,0) = 0 
27. D(fo g) (b) Dy f (0, 0) = v|w| for all unit vectors v = vi + wj. 
t+ u s+u s+t (c) 
~~ 35223 e tuzes? 35342 — suze’ — Astuet” 


» [2 g] 


a? a  sinņ?ð ð 2sinðcoső 3? 
31. — = cos? 0 
(a) ax? SSS aðr? = r ðr r drag 


Pe fa] , Sino a? 
r2 30 r2 80? 


ə? > 8? cos?6 ð  2sinðcosð ə? 11. (a) (—2i + j)/ v5 direction 
— = sin’ 0 + + (b) (i + 2j)/ v5 direction 
dy? ər? r ðr r ard0 J 
2sinð cos ð 4 9086 a? 13. 
r2 30 r? 30? 

f] ð coso 9 

33. @) Ż B aa 

(a) ar are p 0g 


35. dy/dx = (2xy’ — y cos xy)/(x cos xy — 7x? y® + e”) 


37. Oz/dx = 3x?z? /(yz? sinz + sin y — x°z’), z/dy = 
(z? cosz + z cos y)/(yz” sin z + sin y — x3z7) 


N 


a a / 
39. (a) G Pu =l ( n =h; 2) 3 Approximate path 
yz dy dz x,y vh: 
da wa ( 2") =F = dy s 
Ox dy x 


(b) fi = i) (=) 15. Travel along y? = 27x (toward the origin) 
ax /, ax Jy, NXS, 17. y-z=1 19. x-—2y—2z=2 
dw dy dw az 21. The tangent plane has equation x — mz = 2. 
ü ( By is (sz), X ( az )., (5), 23. (3, —3,-3) and (—3, 3, 3) 


27. (a) 3x + 12y+2z+10=0 
SE Cd is a ae plane at (0, 0, 0). 
as 29. Tangent line is 5x — 344 y = —1. 
1 (=) =xw — 2z; 31. Parametric equations: x = 5t + 5, y = 4t — 4. Cartesian 
A equation: 4x — 5y = 40. 
(2) =ni = zE xy =x 33. Parametric equations: x = 14t + 2, y = t — 1. Cartesian 
xw xw — 3y?z equation: x — 14y = 16. 


az 


35. x =-2t-—l,y=t 
37. xs —x, =T 
39. x1 + x2 +--+ Xn-1 — Xn = Sn 
41. (a) Near all points of S such that x 4 0. At such points 
f(x, y) = In(1 — sinxy — x3 y)/x. 
(b) All points (0, y, z) (i.e., the yz-plane). 
(c) 


l,z=t-1 


x 
43. (b) Yes, y = x77. 
y 
1+ 
0.5 F 
x 
-0.5 0 0.5 


(c) The implicit function theorem suggests that we need 
not be able to solve for y in terms of x. Note that we 
can in this case, but that the function fails to be of 
class C! at x = 0. 


45. (a) G(—1, 1, 1) = F(-2, 1) =0 
(b) Hint: Use the chain rule to find G,(—1,1,1)= 
304 0. 
47. (a) ACI, 0, —1, 1,2) = —120 $ 0; apply the general im- 
plicit function theorem. 


ayy 7 dy2 1 dy3 
b 1,0)= 1,0) = —5, 1,0) = —1 
O a=- 50,0 = 4, 50.0) 


5? 


49. (a) Anywhere except where p = Oorz. 
(b) We can solve anywhere except along the z-axis, which 
is where 0 can have any value. 
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Section 2.7 


1. (1.302942538, —0.902880451) 
3. (a) Estimate intersection points near (1,1/2) and 


(—1/2, —3/4). 


(b) (1.103931711, 0.441965716) and (—0.518214315, 


—0.657923613) 
5. (a) Xo = (—1, 1) leads to (—1.2649111, 1.54919334). 


(b) xo = (1, —1) leads to (1.26491106, —1.5491933), 
while x9 =(—1,-—1) leads to (—1.2649111, 
—1.5491933). 

(c) It appears that an initial vector leads to an intersection 
point that lies in the same quadrant. 

9. (—0.9070154, —0.9070154) 


True/False Exercises for Chapter 2 


1. False 

3. False. (The range also requires v Æ 0.) 

5. True 

7. False. (The graph of x? + y? + z? = 0 isa single point.) 
9. False 

11. False. (lim¢.,)+(0,0) fŒ, y) = 0 F 2.) 

13. False 

15. False. (V f(x, y, z) = (0, cos y, 0).) 

17. True 

19. False. (The partial derivatives must be continuous.) 

21. False. (fiy Æ fyx-) 

23. True. (Write the chain rule for this situation.) 

25. False. (The correct equation is x + y + 2z = 2.) 

27. True 29. False 
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Miscellaneous Exercises for Chapter 2 


W (s, 5) 
1.(a) AE) = —x2, RA) = x1 — 3, A) = x2 10 
(b) Domain is R?; range consists of all vectors in R? of the E psp es a Pagat jeee ag 
form ai+bj—ak. E 20 40 60 80 100 120 
3. (a) Domain: {(x,y)|x > 0, y > 0}U{@,y)|x <0, -10E 
y < 0} E w. 
Range: [0, oo) —20 - 2 
(b) Domain is closed. —30 E 
5 -40 
fa,y= Graph | Level curves (b) W (10, 2) 
1 H 
—— D d p 
x? +y +1 20} 
sin yx? + y? B E P 
(By? — 2x2" -2 A b 40 
y? — 3x?y E c 
x2y2e -3 F a 5 
yey c g 
7.0 
11. (a) 15°F W 30, 9 
(b) 5°F 
ol L ' fig 
13.1.3 —40 —20 20 40 


15. (a) 9.57°F; 16.87°F; 8.39 mph 
(b) Effect of windspeed is greater in the Siple formula 
than in the table. wW, 


(c) Ift < 91.4°F or s > 4 mph, the Siple formula gives 
windchill values that are higher than the air tempera- 
ture, which is unrealistic. 


17. (a) 


W (s, 40) 


40 
35 
30 
25 
20 
15 
10 


j j 1 | į ji 1 | S 
20 40 60 80 100 120 


19. 


2k 


23. 


25. 
27. 
29. 


31. 
35. 


Hint: Compare the normal vector to the tangent plane with 
— 
the vector OP. 


(a) 3x —4y —5z=0 
(b) ax + by —cz=0 
(a) 2x — 8y -z =5 
(b) x=1,y=t—1l,z=5-— 8t 
12, 201.4 units/month 
(a) dw/dp = 2p, Ow/dg = ðw/30 = 0 
(a) 0z/dr = e’(cosO(dz/dx) + sinO(dz/dy)), 
dz/00 = e" (— sin 0(9z/3x) + cos 6(dz/dy)); 
dz/dx = e" (cos 0(0z/dr) — sin 0(9z/30)), 
dz/dy = e-"(sin@(dz/dr) + cos 0(əz/30)) 
(b) Hint: 8?z/3x? = e7” (cos? 6(87z/dr7) 
+ (sin? 0 — cos? 0)(9z/dr) 
+2 cosð sinð(ðz/30) 
— 2 cos sin 0(3?z/3r3ð0) 
+ sin? 0(8?z/30”)), 
32?z/ə3y? = e7” (sin? 6(92z/dr2) 
+ (cos? 8 — sin? 0)(3z/3r) 
— 2 sin 0 cos 0(0z/00) 
+2 sin cos 0(d2z/draé@) 
+ cos? 0(8*z/d07)) 


u“ [uD +u Inu + u” (In u) | 


(a) z= f(x,y), where f(x, y) = (x? — 3xy?)/(X? + 
y’) if (x, y) # (0,0); f(x, y) = 0 if (x, y) = (0, 0) 

(b) Yes 

(c) af /ax = (x4 + 6x7y? — 3y*)/@? + 9’), 
af/dy = —8x? y/(x? + y’) (if (x, y) # (0, 0); 
f.(0, 0) = 1, f,(0, 0) = 0 

(d) Du f(0, 0) = (Af/dr)|-=0 = cos 30 

(e) f\(0,0)=0 from (c) but along y=x, 
Sy(x, x) = —2, which does not have a limit of 0. 
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37. Homogeneous of degree 3 
39. Homogeneous of degree 3 
41. Homogeneous of degree 0 


Chapter 3 


Section 3.1 
1. 


7. v(t) = 3i+ 2j, speed = V13, a(t) = 0 
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9. v(t) = (sint + t cost, cost — t sint, 2t), 
speed = v57 + I, 
a(t) = (2cost — t sint, —2 sint — t cost, 2) 
11. (a) 


13. (a) 


15. Wt) =ti+(3t+ Dj 
17. I(t) = (4t — 4, 12t — 16, 80t — 128) 
19; (a) y 


(b) (£) = (t, 10t — 15) 


(c) y=x?-2x41 

(d) y — 5 = 10(x — 2). (Let x = t to verify agreement.) 
21. 117.1875 ft 
23. 25.09° or 64.91° 
25. No 
27. Write out both sides in terms of component functions. 
29. Hint: Differentiate Ix’. 
31. (a) ©) 


33. (a) x(1) = x(—1) = (1, 0) 
(b) 2/2 


Section 3.2 


TI; 
13: 


15. 
17: 


19. 


21: 
23. 


ON Mw = 


. 3/13 

. 6/3 —2 
. 27 

: B+In4 


6a 


Vm +1 |x — xıl 
(a) 


= 


T T T ji 
05 1 15 2 25 3 


(b) The path consists of three C! pieces: one for —2 < t < 
0, another for 0 < t < 1, and another for 1 < t < 2. 


(c) 4/2 
Hint: Begin with x = r cos 0, y = r sin ð. 
1 
T = ——(—5 sin 3t i + 2j + 5 cos 3t k), 
Work j , 
N = — cos 3t i — sin 3t k, 
1 
B= ——(—2 sin 3t i — 5j + 2 cos 3t k), 
755 j 
2 
k= T= 3 
T= /2(1,iViF1,—}v1 Zf), 
N=2/2(0, 4/1 =r, iyt +1), 
B= -5 (1,-vi+1, vI =t), 
k = J2/(6V1—P), t =1/ (3v1 = r) 
(b) k = |sin x| 
ab 
(a) 


(a? sin? t + b? cos? t)3/? 


(b) 


25. (a) 


(b) 
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LS errr tT 


1.5 Fbtrilisviitisii tins 


-2 =1 0 


= 
= 
mr 


0.5 


o 


1 


w Lilt 


Som 


1 2 3 4 


3 
4aJ/2+2cost 


y 


W 


6 


W 


=2 -1 0 1 


2 


Ww 


0 tility 


t 
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27. dung = 4t/V4t? + 1, dnom = 2/ V4 + 1 te 
29. dtang = V5e', Gnorm = 2V5e 

31. Gtang = 4t//2 + 482, Qnorm = 2//1 + 2? 

33. (b) dung = 4t/V4t? + 10, dnorm = 2V 10/4 + 10 


35. Hint: Write v and a in terms of T and N and use the 
Frenet—Serret formulas. 


37. Hint: Use formula (17) and Exercise 35. 
39. Hint: Recall that ||x — xo]? = (x — xo) « © — Xo). 
41. Hint: Calculate T and show that (x(t) — (1, 0, 0))- T = 0. 


Section 3.3 


1. y 9. 
PPS OF = ba ANY 
ee ee ee N 
LFF? PPR RASS 
ff SP ee a aN 
ee ee TE a A 
BP A BS at Sr Re. A 
af gh —— x 
a ee oe ee ee ee ae ee 
Shas are me gan Ge 
Roe SS See ee ae ae a 
KARE TS eee SS 
a ee A A 
NNNNSS FAZIS 
3 y 11 
SPOETTNNNSSASS 
ae 2 E BO OOS. 
PFEPFEVN SRVRAS 
Pe ae ae ae BN Roy, Oe 
BP a OS SE ER Re Te, ae 
x 
ig ey ee E) Ee ae eer 
eR Re EZER 
SLUNG g PEL FL 
eR Re ie ae 
ISA GL PLES 
13. y 
5. y EER NEE 
Min, Me, X 
SASSA AA ane T 
SN A | od A Ae Ss de ce ee ee en ea 
Sere N | eA ee ee ee eee 
SMES | oe eee paa ee A E 
A NCS. | ot ot et AA Oe aaayy aw ees 
SANS | SAAT a 
> ` — x Pa ae ee le a aie ie 
ee ee Pere ere 
eee ee [4 De eee a 
“Sette S| a a ee Soe oe 
a NN (feta SEEREN RASS eeoe 
SN NE | Ae Le ee ee 
SO NN Ld A oe 


= 


15; y 


DAER TEET) 
PRP Aa 
Pp paar 3s 
PP aad ewe 
p par ¥ 3 
pay yya 


meer 


-a 
owes 


anneer 


NAAR me ee wee 


mW VV VN a4 


eaassang 
meagan 
wea ageer 


a Sa he a Sy | Re ee 


Zi, (= e') 
2—t 
23. (a) F = V f, where f(x, y, z) = 3x — 2y + z. 
(b) Equipotential surfaces are parallel planes with equa- 
tion 3x —2y+z=c (i.e. planes with normal 
@G, —2, 1)): 
25. Hint: Use the chain rule and the facts that V f = F and x 
is a flow line of F. 
31. Hint: Differentiate the defining differential equation for 
the flow with respect to x = (x1, X2,..-,Xn). 


Answers to Selected Exercises 585 


Section 3.4 


= =í 


21. 
23. 
25. 
27. 


29. 
33. 


Oo a ee 


2x + 2y 

3 

2x1 + 4x2 + 6x3 +--+ + 2NXn 

2xzi— 2yzj— e” k 

0 

(x? — xye™*)i + (y? — 2xy)j + (ze = 2yz)k 

(a) div F > 0 on all of R? 

(b) div F < 0 on all of R? 

(c) div F>0 on {(x,y)|x>0}, div F<0 on 
{(x, y) | x < 0} 

(d) div F>0 on {(x,y)| y <0}, dv F<0O on 
{(x, y) | y > 0} 

Write out in terms of the component functions of F and G. 

Write out in terms of the component functions of F. 

Write out in terms of the component functions of F and G. 

First use the chain rule to replace occurrences of the 

Cartesian differential operators in V by combinations 

of spherical differential operators. Then compute V f. 

Finally, replace i, j, and k by appropriate combinations 

of ep, €y, and eg. (See also §1.7.) 

Write out the components of f V f. 


(1/V3, 4//3, -1/V3) 


True/False Exercises for Chapter 3 


. True 


True 


False. (There should be a negative sign in the second term 
on the right.) 


. True 


False 


. True 

. True 

. False. (It’s a scalar field.) 

. True 

. False. (It’s a meaningless expression.) 
. True. (Check that F(x(t)) = x’(t).) 

. False. (V x F Æ 0.) 

. False. (Consider F = yi + xj.) 

. True 

. False. (V (V x F) 40.) 


Miscellaneous Exercises for Chapter 3 


1. 


3. 


(a) D (b) F (c) A 
AB (ec) C (ff) E 


Hint: Differentiate (ds /dt)? = l xro. 


11. 


13. 


15: 
17. 


19. 


23. 


25; 
27. 


Answers to Selected Exercises 


y 

1.0¢- 

0.8 
E w=5 

0.6 |- w=2 w=0 

ae ee 

0.2 

oa T a A A a S R A S tii x 
0 0.2 0.4 0.6 0.8 1.0 
. (a) Hint: Calculate x(0) and x(1). 


(b) Hint: Show that 


1 1 Xy +x. yity3 
x(5) = 7g y) + pg (CS, 4). 


@) gv i — 2x2 + 3 + O1 — 292 + yY 
b) ata G1 — 2x2 + x3? + O — 292 yY 


Fw) 

(a) Hint: Find where x'(t) = (0, 0). 

(b) Hint: The tangent line at x(fo) is given by I(s) = 
X(fo) + 5x’(to). Find the y-intercept of this line. 

Hint: Begin with x(0) = (f(@) cos@, f(@) sin@). 

(a) y(t) = (a(cost + t sint), a(sint — t cost )) 

(b) y 


4 ot 


iania il ga a a t i a 
tag 


4a N odlad 
5 4 3 2 1 0 1 


N 


(a) Show that || x(t) — y(t)|| = s(t). 

(b) The vector difference x(t) — y(t) is a tangent vector 
(to x) s(t) units long. 

e(t) = (a(t + sint), a(cost — 1)), which is a another type 

of cycloid. 

Hint: Use the Frenet—Serret formulas. 

(a) Show that (x’(s))? + (y’(s))? = 1 by means of the 
fundamental theorem of calculus. 


29. 


31. (a) 
35. 


b) « = |g’(s)| 

(c) Set g’(s) = k(s). Find g by antidifferentiation and set 
x(s) = fp cos g(t)dt, y(s) = fj sin g(t)dt. 

(d) x(s) = fj cos(t*/2)dt, y(s) = fy sin(t?/2) dt 


(e) T 


15 Tgp ae a 


Litt 


1.5 Eliri 
1 0 3 
(a) Calculate B and remember that it is a unit vector. 
(b) B is constant, so B’ = 0. Next use the Frenet—Serret 
formula. 
2 2 2 
+ h4 /4r 
ath in”) 8277 ft 
a 


N’ = —xT + tB. Argue that N’ 4 0. 


37. (a) 2V2n (b) F= V2(yi-2xj + yk) 
41. Hint: Use Proposition 1.4. 

43. No 

Chapter 4 


Section 4.1 


T; 
3: 


N 


. PJ=3+ (x 
. ps(x) = 1 — (x — 2/2)? /2 + (x — z /2)*/24 
. pœ, y) = $ — Ax — 1)/9 + 2 + 1)/9, 


. pi(x, y, z) = 1 +x + 8y + 4z, 


pa(x) = 1 + 2x + 2x? + 4x? /3 + 2x4/3 
pa(x) = 1 — 2(x — 1) 4+.3(x — 1)? — 4x — 1 + 
5(x — 1) 


9)/6 — (x — 997/216 + (x — 9} /3888 


p(x, y) = $ — Ax — 1)/9 + 20 + 1)/9 + œ — 17/27 
— 8(x — I(y + 1)/27 + (y + 1}?/27 


- pitt, y) = —1— 2x, 


pox, y) = —1 — 2x — 2x7 + Wy — x} /2 
pox, y, z) = xy — 
3y? + 2xz 


. Pix, Y,Z) = pox, y,z) =0 


v6 _ v2 
4 4 

ie A 
4 


21. 


23. 


25. 


27. 


29. 
31. 
33. 
35. 
37. 
39. 
41. 


43. 


6 2 0 
2 0 =2 
0 =2 12 


px, y)=1+[0 of] 


z—2 
0 3 0 x 
1 
+3[x y z—2 | 3 8 2 y 
02 0 z—2 
(a) Df0)=[1 2 a], 
i 22 3 n 
2 6 2n 
Hf)=|3 6 9 3n 
n 2n 3n --- n? 
(b) pi&i: Xn) = Lxi F 2x +--+ +nx, 


Pline ta = lai 5 ae ijXiXj 
(a) 2— z +3xy +x°y — xz? + 2y3 
(b) -4 -4x — 1) +11 +1) —2 4 $[4@ 1 + 
16(x — 1)(y + 1) — 12(y + 1}? — 227] + 
z[18(x — 1)? + 24(x — 1)7(y + 1) — 6(@ — Dz? + 
12(y + 1)3] 
2x dx + 6y dy — 6z?dz 


e* cos ydx + (e” sinz — e* sin y)dy + e” coszdz 


(a) 388.08 (b) 0.24625 (c) 1.1 
The (1, 1)-entry (upper left) 

2.4 cm 

0.0068 m 


(a) pox, y)=1- 51? +0 - 5] 
(b) Accurate to at least 0.0360 


(a) pox, y) = 5 — y—2x(y—- 5) 
(b) Accurate to at least 0.0311 


Section 4.2 


1. 


(a) (2, 3) 
b) Af = =k? — k? 
(c) There is a local maximum at (2, 3). 


23; 


25. 


27. 


. Local maximum at ($, 4) 
5 


. Local minimum at ( 


. Saddle point at (0, 5) local minimum at (0, — 


. Local maximum at (-3. t) 


. Saddle point at (0, 6, —3) 
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2 


215); saddle point at (3, 1) 


. Minimum at (4 5) 


72 


. Saddle point at (0, 0); minimum at (0, 2) 


1 
+) 


3 


. Local minimum at (0, 0, 0) 
. Saddle point at (—1, 4, 5) 


72°22 


. (a) (0, —2) and (0, 3) 


(b) Local maximum at (0, —2); 
local minimum at (0, 3) 


(a) Minimum if a,b > 0; maximum if a, b < 0; saddle 
point otherwise 

(b) Minimum if a, b, c > 0; maximum if a, b, c < 0; 
saddle point otherwise 

(c) Minimum if aj,...,d, > 0; maximum if q,..., 
an < 0; saddle point otherwise 


Saddle points at (0, 0), (4 3,0); local maxima at 


(aA) C A) 

Saddle points at (0, 0, 0, 0), (-V2, 2V2, 1, —v/2), 
(v2, 2/2, —1, — V2) (-V2, —2V2, —1, V2), 
(V2, —2V2, 1, /2) 


36 48 12 
 ( ) 


B: B-B 


. 1100 units of model X and 700 units of model Y 
. Maximum of 8 at (0, 0, 2); 


na 191 32 8 
minimum of —= at (-¥, 3, >) 


. Maximum of 1 at (7/2, 0), (7/2, 277), (37/2, 1); 


minimum of —1 at (37/2, 0), (37/2, 277), (7/2, 7) 


. Maximum of 11 at (2, 0); minimum of 3 at (5. 1) 


. Maximum of e° at (0, 1, —2); minimum ofe at all (x, y, z) 


such that x? + y? —2y+2?+42=0 


. (b) Maximum 
. (b) Neither 
. (b) Maximum 


. (a) Local maximum at (0, 0, 0) 


(b) (0, 0, 0) = e? is a global maximum. 


. Global minimum of 2+1n2 at (2,5). No global 


maximum. 


- (a) {A Vy € R} U {(, 2)|x € R} 


(b) Maxima of 3 along critical points in (a). 


. (b) Critical points are (2, 1) and (0, —1). 
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(c) 35. (a) (V6, V6) and (—V6, —V/6) 
39. Nearest points are (/5, —V/5) and (-JV5, 5); 
farthest points are (5, 5) and (—5, —5). 
41. (a) Critical point at (1, 0) 
(b) There is a minimum of 0 at (0, 0) and a maximum of 
1 at (1, 0). 


ose 
QO 


2 


Section 4.3 


1. (a) Minimize f(x, y) =x? + y? + (2x —3y—4) to 
find that the closest point is (3 = Z). 


PP? 
(b) Minimize f(x, y, z) = x? + y? + 2? subject to 2x — 
3y—-z=4. 
3. (V2, V2) and (—-/2, —/2) 
5. (1, 4, 2), (3, 0, 0), (0, 2, 0) and (0, 0, 6) 
7. (th te) 
9. (3,353) 
11. (£45, 4, v2) and (445, -45, -v2) 
13. (a) (+3. 1), (0+4) 
(b) (+ Li) give maxima; 0+4) give minima. (c) Vg = Oat (0, 0) 
2 43. (a) Hint: Check thatdL/0l; = c; — g;(x)fori = 1,..., k 
1s. (94 2), YE 4 Be) ] 
17. (£1, 0, 0), (0, £1, 0), (0, 0, £1), (3, —3, 4), aL af 128 
22 _1 = i 
(-3, “gi —4), Ox; Ox; i=l Ox; 
forj =1,...,n. 
(iy. -b #72) (V4. -b V9) ee i 
19 (2 L 1-v2 2), ( 1 i iiv 1+2) Section 4.4 
NV? y2? 2? 2 PU 2? 2? 2? 2 


1. 5x —Ty+14=0 


3. (a) D(a, b) = 710i — @/xi + b)? 
(b) Minimize D with respect to a and b. 


21. The numbers are 6, 6, 6. 
23. Maximum value: 6. Minimum value: 0. 


25. Height should be equal i diameter. 5. Hint: Let D(a, b, c) = Y? O; — (ax? + bx; + ¢)}. 
27. Locate at either (—2, 2, 1) or (—2, —2, 1). ; . PE . 
. 7. (b) There is a single, stable equilibrium point at 
29. Largest sphere has equation x? + y? + z? = 2. (-, -1 ; 
9 4 5 
31. (5,2, 3) 9. Single equilibrium point at (—1, 3, 3). There are no stable 


33. Highest point is (—1, —1, 2); lowest point is (4, 5, 5). equilibria. 


11. Produce 50,000 each of both the standard and executive 
models and 100,000 deluxe models. 


13. Irrigate only; Purchase 3333.3 gal of water. 


15. (a) Invest $120,000 for capital equipment and $240,000 
for labor. 
(b) Hint: Note that L/K = 2. 


True/False Exercises for Chapter 4 


1. True 

3. True 

5. True 

7. False. (f is most sensitive to changes in y.) 

9. False 

11. True 

13. False. (Consider the function f(x, y) = x? + y?.) 

15. True 

17. False. (The point is not a critical point of the function.) 

19. True 

21. False. (The critical point is a saddle point.) 

23. False. (Extrema may also occur at points where g = c and 
Vg =0.) 

25. False. (You will have to solve a system of 7 equations in 
7 unknowns.) 

27. True 


29. False. (The equilibrium points are the critical points of the 
potential function.) 


Miscellaneous Exercises for Chapter 4 


ls ro = 2ho 

3. Price the Mocha at $2.70 per pound and the Kona at $5 
per pound. 

5. Maximum value of 4 at (1, —/3, 0). Minimum value of 
—4 at (—1, v3, 0). 

7. (e) 


(E 0). (= 
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(b) 


7) 
Uf 


r. 


AMAHA 
VA 
PAH 


PEE 
1.0 Baas 
ZSSS 
SS ie) 
0.5 ZSSS: 
S m E +A 
z 0 Bae: 1 
EE 
-0.5 ASAS 
0 
-1.0 y 


(c) Maxima at (+ 45.0), ( 
1 
at (+: 
(0, 0, £1) 
11. 1/(@ +a} +--+ a2) 
13. Dimensions are 4 (x-direction) by 2/2 (y-direction) by 
2 (z-direction). 
15. 1 
17. a/3 by b/3 by c/3 
19. 3./5/8 


21. Hint: Minimize D? = (x — xo) + (y — yo)’, where 
(x, y) denotes a point on the line ax + by = d. 


J 45,0); minima 


0); saddle points at 


m0) (“yee 


23. (a) Hint: Show that the maximum value occurs when 
xX =y =z =a). 
(b) Since f(x, y, z) = x?y?z? is maximized when x 
y? = z? = a? /3, we must have x?y?z? < (a? /3P = 
(x? + y? 2NF. 
(c) Hint: Since x1, x2,...,%, are assumed to be posi- 
tive, we can write x; = y? fori = 1,...,n. Maximize 


2— 


Ft» Yas +++, Yn) = VIVE “++ Yn Subject to yf + y3 + 
sty =a z 
Gi N ae 
: : = 5 


(at+c)+/(a— c}? + 4b? 
5 . 


(b) Rewrite as 41, à2 = 


Chapter 5 


Section 5.1 


40 
‘A; 3 


3. 6(e — 1) 
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bak —2e? +e+2In2—- 2 

7. (a) Volume = f? fo (x? + y? + 2)dy dx = 26 
(b) Volume = ff [?,(x? +y? + 2)dx dy 

9. i, fiers + yt sinwx)dy dx = 2 + 66/52 


11. The iterated integral gives the volume of the region 
bounded by the graph of z = 16 — x? — y?, the xy-plane, 
and the planes x = 1, x = 3, y 2, y = 2. The value 
of the integral is a, 


13. The iterated integral gives the volume of the region 
bounded by the graph of z = 4 — x?, the xy-plane, and 
the planes x 2, x = 2, y = 0, y = 5. The value of the 
integral is 1, 

15. The iterated integral gives the volume of the region 
bounded by the graph of z = 5 — |y|, the xy-plane, and 
the planes x l,x=2,y 5, y = 5. The value of 
the integral is 75. 


Section 5.2 


1. 0 
3. (a) Check that f2, fo" * x3 dydx =0. 
(b) Hint: The region D is symmetric about the y-axis and 
x? is an odd function. 


5. 4 


(4, 2) 


9. 1—cos 16 
11. 32/2 y 
1 
VOS 
2 
-1 
13. 0 y 
y=e* (1, e) 
PA 
1 
D 
t t t x 
-1 2 3 4 
-1 
z 
y=-e* (1, -e) 
4 
15. 3 
99 
17. 55 
19. $(e- 1) 
21, 128 


5 

23. Hint: Write f fẹ cf dA and ffp f dA as limits of appro- 
priate Riemann sums. 

25. Hint: Use the fact that Daa a;l < J; lal. 


1 
27, 


29. Area is mab. 


31. 7 


33. 18 


25, 


37. 11,664 


39. 1$ 


Answers to Selected Exercises 591 


41. (a) i f(x, y)dy = 2 regardless of whether x is rational T to La e” dydx + ig Ln e" dydx = $ + @/2—e 
or irrational. 
(b) Value is 2. 
(c) 2 
(d) Converges to 3. 
(e) The Riemann sum has no uniquely determined limit. 


Section 5.3 


1. (a) 4 
(b) 


(2, 4) 


© Jo {v3 2x + Idx dy =4 


4 2-y/2 ie 
3. fo Jo  y dx dy = = k 
13. 86 

15. (1 — sin 1)/2 

4404) 17. (27 — 1)/6 


19. (a) Ifyour computer algebra system can provide a simple 
answer, it should be 1a — cos 2). 
(b) The integral with respect to y requires two applica- 
y=4-2x tions of integration by parts. 
(c) It should take the computer only a fraction of a second 
to find that fi >” y? cos (xy) dx dy = 1(1 — cos 2), 
much faster than the evaluation in part (a). 
(2, 0) x 21. (a) Itis quite possible that the computer will be unable to 
make the evaluation. 


(b) With order reversed, the computer easily finds 


3 px? 
5: i i (x + y)dy dx = gi je jm ecs x dydx =e-—l. 


(3, 9) 
Section 5.4 


9. Volume = |“ S PE 


ite 2 
13. 0 
15. 4 


23. 
25. 


27. 


29. 


Answers to Selected Exercises 


V2 7/2 
h ae -h~ ia y, z)dzdydx 


= hdo mu - f(x, y, z)dy dz dx 
shh feig £0 y, Day dx dz 
= fin fp? fF. y, ddx dzdy 
= fi SAE Ie fe, y, z)dx dy dz 


WS, R Fœ, y, 2)dzdx dy 
= fo R ff fŒ, y, 2dydzdx 
= te Li f(x, y,z)dy dx dz 
= fo fe fy fy, zdxdzdy 
= i, ER f(x, y,z)dx dy dz 


(a) Bottom surface is z = x? + 3y?, top surface is z = 
4— yia a in iy-plane is x?/4 + y2? < 1, y > 0. 


(b) pee plow nay? + y3)dzdx dy 

(c) hh FI nee x? + y?)dx dzdy 

OE yn wee a 
A Me VR Tort + dx dy dz 


E S ae ee 8 3 + y3) dy dzdx + 


I er Oe + y*)dy dzdx 


Section 5.5 


1. (a) T(u, v) = 


3: 


3 0 u 
0 -1 v 
(b) D = [0, 3] x [-1, 0] 


D is the parallelogram with vertices (0, 0), (11, 2), (4, 3), 
(15,5): 


. T takes W* onto the parallelepiped with vertices 


(0, 0, 0), (3, 1, 5), (—1, —1, 3), (0, 2, —1), (2, 0, 8), 
(3, 3, 4), (—1, 1, 2), and (2, 2, 7). 

(a) D = {@, y, z) |x? +y? +z <1} 

(b) D= {x,y,z | x? +y? +27 < 1, x,y,z > 0} 


(c) F yz) l4 <x? +y +z <l, x,y,z > 0} 
E udve” . } du dv = 8(e4 — 1)/3 
; Ke — 1/e) 
. 3m 
. 4867/5 
. 3In(vV2 + 1) 
. (16 — 3x)/12 


. xa?/4ifn is odd, 


ma? /2 ifn is even. 


. 2+7/4 
. (x sin 1)/3 

. 4n(V2 + 1)+ v2- 1) 
. m(e*—e+1) 

. 487 

; Z ja? — (a 
. In((1— a?e” + (b? — 1)e”) 
. 85057 /32 

. 4/2 n(5//5 — 8)/3 

. 6567/5 


_ bY 


Section 5.6 


1; 


3 
5; 


. = 
. (a) (Z, y, 2 = 


. &, yz) = 
. 3a/8 


(a) 80 cases 

(b) $1.60 

e? —2e+1 

(a) c, where c is the constant of proportionality. 
b) {y z) 1x? +y +z = 1} 

30 — 37 


` 30— 5r 
. 90 seconds 
. If the plate is located at {(x, y) | x? + y? < a’, y > 0}, 


then (x, y) = (0, 4a /37). 


-& N= (7.7) 
. Œ 5) = ((WW3 + 82)/BV3 + 4r), 15/(V3 + 47) 


(21/20, 0) 

(5,0, 2) 
(b) (&. 5,2) = (8.0, 2) 
(0, 0, - 7) 


(a) I, =L=; =4 
(b) Tz Sly = Tz a fs 


. (a) L = 65617 /4; r; = (3V3)/(2V2) 


(b) I, = 87487 /35; r; = BV3)/V7 


29. 1496/135 
31. 1167/3 
33. V = -3GMm(b? — a*)/(2(b> — a3)) 


Section 5.7 
1. (a) Th,3 = —0.621375 
(b) Exact value is —0.619. 


3. (a) Ta3 = 0.19825 
(b) Exact value is 0.197402. 


5. (a) T3 = 0.336123 
(b) Exact value is 0.331642. 


7. (a) S22 = 0.00010125 
(b) Exact value is 0.0000972. 


9. (a) S22 = 0.331871 
(b) Exact value is 0.331642. 


11. (a) S22 = 0.414325 
(b) Exact value is 0.414214. 


13. (a) fy fy V4x2 + 36y? + Ldy dx 
(b) T44 = 3.52366 
15. S22 is more accurate than T4 4. 


17. (a) n must be at least 19. 
(b) n must be at least 1. 


19. (a) S2,2 = —0.00390625 
(b) Exact value is —1/256. 
(c) The answers are the same because E22 = 0 by Theo- 
rem 7.4. 


21. T33 = 0.190978 
23. T33 = 0.412888 
25. T33 = 0.724061 


True/False Exercises for Chapter 5 
1. False. (Not all rectangles must have sides parallel to the 
coordinate axes.) 
3. True 
5. False 
7. False 
9. True 
11. True 
13. False. (The value of the integral is 3.) 
15. True 
17. True. (The inner integral is zero because of symmetry.) 
19. True 
21. False. (The integrals are opposites of one another.) 


23. False. (A factor of r should appear in the integrand.) 
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25. False. (A factor of p is missing in the integrand.) 
27. True 
29. True 


Miscellaneous Exercises for Chapter 5 
1. 727 
3. (a) fo PE za 3dzdydx 
and É, pve 9—x? 


2 3 dz dx dy are two 


2x?+y? 
possibilities. 
) 81/3 2/4 
J1—x2 9—x2—y2 
5. (a) ie J ah dzdy dx 
(b) fig i R o° sing dp do d0 + 
d pe 1/3 > e-sing dp dg dé. Value is 


a(S) 


7. Bh a foo” rdr dO dz =a 
9. (sin 1 + sin 2)/6 
bf 1x? /a 

1. @ [LY 
(b) E* ={G,y) (P+ = 1} 
(c) Area = fy” fy abr dr dO 

13. Area is m. 

15. —21n4 

17. (In2)(tan~! 4 — 7) 


dy dx 


(ara fa =x ay?  pafar—x2-y2=z 
19. @) oo ae I J 
(b) x?at/2 
(c) Five-dimensional ball has volume 877a°/15; six- 
dimensional ball has volume 23a°/6. The pattern is 
not very clear from this information. 


25. Within a disk of radius //5a /3 about the center, where a 
is the radius of the hemisphere. 


27. (a) 4/1 =€ — /e(/1 — 5 — V8) 
(b) 4 
29. Integral does not converge. 
31. —47/9 
33. Converges when p > 1 and q > 1; value is 
1 


(p-a -1 
35. Integral does not converge. 


Y | dwdzdy dx 
ak ye" 


a*—x?-y? 


37. (a) Hint: Break up the integral into a sum of integrals 
from 0 to 1 and from 1 to oo. Show convergence of 
the improper integral from 1 to oo by comparing it to 


oe e «dx. 
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(b) Hint: Begin with ffe e™™ =? dA and use laws of ex- 
ponents. 


(c) x(1 — e=’) 
(d) x 
(e) Jr 
39. (b) 35 
41. C =ab/4 
43. (a) C= sp 
(b) 5 
45. 1—e 4 ~ 0.2212 


Chapter 6 
Section 6.1 
1. (a) 50 (b) 4 
3. (35/35 — 174/17)/27 
5. 54/17 
7. (59/7 + 87)/12 
9.2 11. ¢ 13. Be — 5)/3 
15. 4a + 1627/3 
17. 3x 19. l1-e* 


21. (a) EF -ds= 5, fF -ds = —3 


(b) y(t) = x(1 — 2t). Thus, the images of x and y are the 
same, although y traces the image in the opposite di- 
rection to that of x. For these reasons, the results of 
part (a) could have been anticipated. 


23. 0 
25. — 187 27. 9 29. 0 
cpe 


3 
33. Hint: Use formula (3). 


35. (a) 2500/13 ft-lb 


(b) 7500 ft-lb 
43. (a) 7.65625 (b) 7.5 


Section 6.2 
1. $5) M dx +N dy = ffn(Nx — My) dA = 8x 
3. fın M dx +N dy = ffh(Nx — My)dA = —4 
5. fın M dx + N dy = ffp(Nx — My) dA = -144/27 


7. (a) 0 
(b) You will need to compute four separate line 
integrals—one for each edge of the square. 
9. —2 
11. Hint: Calculate 5 fc —y dx + x dy, where C is the bound- 
ary of R, oriented counterclockwise. 


O) 35 
19. Hint: Parametrize each line segment of the perimeter. 
21. —127 
23. (a) Hint: Show V -F = 0. 
25. Directly apply Green’s theorem. 


27. The line integral is +3 times the area of the rectangle, 
where the sign depends on the orientation of the boundary. 
29. Hint: uVv = (udv/dx,udv/dy). Then use Green’s 
theorem. 
af 
on 


31. Hint: Begin with f ds and then use Green’s 
aD 


theorem. 


Section 6.3 


1. (a) 3 
(b) > 
(c) No. Line integrals are not path-independent. 
3. Not conservative 
5. Not conservative 
7. Conservative; f(x, y) = xe” + cosxy 
9. Conservative; f(x, y) = 3x7y? — x? + iy? 

11. Conservative; f(x, y, z) = 2x?yz3 — x?y + yz 

13. Conservative; f(x, y, z) = x? + xy + sin yz 

15. Conservative; f(x, y, z) = e* sin y +23 + 2z 

17. Not conservative 

21. N(x, y) = se +x3e? + u(y), where u is any function 

of y of class C?. 

23. N(x, y, z) = {xf + z7 + A(y), h of class C!. 

25. (a) Check that V x F = 0. Since F is of class C! on all 
of R?, by Theorem 3.5, F is conservative. A scalar 
potential is f(x, y, z) = ix? + sin y sinz. 

(b) 7 + sine sine? — (sin 1} 

27. 0 

29. 6 

31. —480 

33. (a) Conservative on {(x, y) | y > 0} and on 


{(x, y) | y < 0} 


b) f(x, y) = y +x? + D/Qy’) 
(œ) 1 


35. (a) F is conservative. (b) —2 
37. Work = GMm (a7 — —) 
Ixill lxo] 


True/False Exercises for Chapter 6 


1. True 

3. False. (It’s negative.) 

5. False. (The integral is 0.) 

7. False. (There is equality only up to sign.) 

9. True 

11. True 

13. True 

15. False. (The line integral could be + ie || F || ds, depending 
on whether F points in the same or the opposite direction 
as C.) 

17. False. (Let F = yi — xj and consider Green’s theorem.) 

19. False. (Under appropriate conditions, the integral is 
f(B) — f(A).) 

21. True 

23. False. (For the vector field to be conservative, the line inte- 
gral must be zero for all closed curves, not just a particular 
one.) 

25. False. (The vector field (e* cos y sin z, e* sin y sin z, 
e* cos y cos z) is not conservative.) 

27. False. (The domain is not simply-connected.) 

29. False. (f is only defined up to at most a constant.) 


Miscellaneous Exercises for Chapter 6 


1. Break up C into n segments, each of length Asg. By 
continuity, f will be nearly constant on each segment, 
so [fave © dopey Se ASK/ Dip, Ase. (Here ec; is any 
point in the kth segment.) The formula follows after taking 
limits as all As; — 0. 


3. 2a/x 
5.2 

z — j — ( 8 v2x-2V2 
aa 
9. (a) = b) 2 


7169 17-1 
11. k = p> 


r. = _ / 169v17-1 
* 1190./17—70 


13. ,=27V11, r = %2 


15. (a) S? g(f(@)cos4, FO) sin) /G OV + (FO) d0 
(6) VTO [ce — 1)/9 + 12(1 — e'®™)/37] 


17. 6x 
19. K =2n 
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21. g— 
23. Hint: Use the formula Area = $ Sap —ydx+xdy. 
25. Hint: ¥ = ff, x dA/area of D, 


J = {fp y dA/area of D. Now apply Green’s theorem to 


cos 1 — sin 1 


fap x° dy and $p xy dy. 
27.x=y=-H 
29. Hint: Use the result of Exercise 28 twice. 
35. (a) Both the line integral and the double integral are zero. 
(b) No. The double integral is not defined properly over 
the disk because F is undefined at the origin. 


(c) Shk G T Ere )]dA = 0 = fe F -ds + 
fe, F - ds. Since $c, F-ds = —2z, the result follows. 
37. (a) 0 
(b) Apply the divergence theorem. 
39. f F-ds= f —VV -ds = —V(B)+ V(A). Now use 
Exercise 38. 


Chapter 7 


Section 7.1 
1. (@) —i— 4j +2k 
(b) x+4y -2z =5 
3. y+4z=4 
5. (a) 


(b) Yes 
(c) 4x — 2y =z =3 
7. (a) Smooth except at (0, 0, 0). Tangent plane at (1, v3, 4) 
has equation 2x + 2/3y — z = 4. 
(b) S is a paraboloid. 


(C) z= +y 
(d) Yes it does. At (0, 0, 0), the tangent plane has equation 
z=0. 


9. Hint: Consider x? + y?. 
11. (a)-(c) All versions give the equation 


—x+y+VJ2z= l. 
13. X(s, t) = (2 cos s, t, 2sins), 0 < s <22,-l1<t<3 


15. Xı(s, t) = (s, t, VS? +t + T), 
Xo(s, t) = (s. t,—Vs2 +t? + 1), (s,the R? 


17. (a) z= ts? = x? 
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21. 


23. 
25. 
27. 
29. 
31. 
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(b) No 
(c) 


(d) Points on the positive z-axis 

(e) —4x +8y +z =4 

(£) (0, 0, 1) = X(+1, 0); tangent planes have equations 
xty=0. 

(a) At the point (a, g(a, c), c), the tangent plane has equa- 
a eu c)(x — a) — (y — g(a, c)) + 8:(a, c) — 
c) =U. 

(b) At the point (A(b, c), b, c), the tangent plane has equa- 
tion —(x — h(b, c)) + hy(b, ey — b) + h: (b, c)(z — 
c)=0. 

{xe R? |x =(1,0,1)+(s — 1, 2, 0) + 

(¢ + 1)(0, 1, —2)} or x =s, y=2s+t-—1, z= —2t 

1. To verify consistency with Exercise 5(b), check that 
the points given by the parametric equations all lie in the 

plane determined by the equation in Exercise 5(b). 


Jon 

A4naV/a? — b? 
(653/2 — 1739/2) /24 
JVI +a . area of D 
16a? 


Section 7.2 


11. 
13. 
15. 
17. 
19. 
21. 


. (a) 4rat 


. (a) Parametrize the cylinder as x = 2 cost, y = 2 sint, 


ON Ww So 


. 2673/3 


1 


640 
3 


(b) 4rat/3 


z=s with —2 <s <2, 0 <t < 2r. The integral 
evaluates to —647 . 

(b) The integral is —4 - (surface area of S) = —64z. 

0 

2977/2 

367 

0 

2ra? /3 


—ra? 


23. 
25. 
27. 


29. 


8x4 
13/12 —e?/12 + e/2 
(a) 


O) m = (x/(V22), y/(V/22), -1/V2), m = @, y, 0) 
(c) —14567/3 
(a) 


(b) (a coss, a sin s, 0) 
(c) N(s, 0) = (a cos s (2 cos(s/2) + sin(s/2)), 
a sin s (2 cos(s/2) + sin(s/2)), 
a(2 sin(s /2) — cos(s/2))). 
From this, we see that N(0, 0) = (2a, 0, —a), while 
N(2z, 0) = (—2a, 0, a). Therefore, the Klein bottle 
cannot be orientable, since the normal vector along 
the s-coordinate curve at t = 0 changes direction. 


Section 7.3 


1. 
5: 
9. 
11. 
13. 


15. 
17. 
25. 


0 

0 

4m (b? — a”) 

451 

(a) Hint: Use the double angle formula. 

b) -# 

1/2 

6257/8 

(a) ma 

(b) za 

(c) The answers in parts (a) and (b) are the same; the 
three flat quarter-circles that are part of dD do not 
contribute anything to ff; Vf -ndS. 


to 
oo 


29. 


(a) ff; F-ds 
x F;(r,0,z + Az/2)r A9 Ar 
— Fir, 0, z — Az/2)r A0 Ar 
+ F,(r + Ar/2, 6, zr + Ar/2)A0 Az 
— F,(r — Ar/2, 6, z)(r — Ar/2)A@ Az 
+ Fo(r, 0 + A@/2, z)Ar Az 
— Fo(r,6 — A@/2, z)Ar Az 


Section 7.4 


T, 


3; 


Hint: Use Gauss’s theorem and the product rule for 

V (f V8). 

(a) Hint: Use Green’s first formula with f = g. 

(b) Hint: Use part (a) and the fact that Vf -Vf= 
IV fll’. 


. Hint: Use the argument in Exercise 6 and the product rule 


for V -(kKVT). 


. (a) Hint: Use Gauss’s theorem and Exercise 8. 


(b) Heat flows into D from the inner sphere and out 
through the outer sphere at the same rate. 


. Hint: Use Ampére’s and Gauss’s laws. 
. (b) In each case k = oeo. 
. Hint: Apply Gauss’s theorem to P = E x B, then Fara- 


day’s and Ampére’s laws. 


. Hint: Apply the arguments used in Exercise 20 to each 


component integral of B. 


True/False Exercises for Chapter 7 


. True 

. True. (Let u = s? and v = tant.) 

. False. (The limits of integration are not correct.) 

. False. (The value of the integral is 24.) 

. True 

. False. (The integral has value 3277.) 

. False. (The value is 0.) 

. False. (The surface must be connected.) 

. True. (The result follows from Stokes’s theorem.) 
. True. (Use Gauss’s theorem.) 

. False. (Gauss’s theorem implies that the integral is at most 


twice the surface area.) 


. True 

. False. (Should be the flux of the curl of F.) 

. True. (Apply Green’s first formula.) 

. False. (f is determined up to addition of a harmonic 


function.) 


Miscellaneous Exercises for Chapter 7 


1. 


(a) C 
(d) D 


(b) E 
(e) F 


(c) A 
(f) B 
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Answers to Selected Exercises 


(a) (Vz? + 1 cos0, vz? + 1sin9, z) 
(b) (avs? + 1 cost, bys? + 1 sint, cs) 


(a) (asing cos0, bsing sin@, c cosg) 


an px |b? sinf y cos? 6 + a2c? sin’ o sin? 6 
O SE / ? ý dy do 


+a’b* cos g sin? o 


. (a) (scost, f(s), s sint) 


(b) 2 


. (a/2, a/2, a/2) 
. (0, 0, a/3) 
. (a) 1575/2 


(b) /5/2 
(©) I, = 62 52k/5, r; = J93/35 


. (a) L = I, = 20 abd(3a? + 2b7)/3 


(b) ry = ry = y (3a? + 2b7)/6 


. Hint: Use Stokes’s theorem. 


25. Use Exercise 24. 
31. Hint: Show V - (x/||x||?) = 0 where defined. 
35. V «F Æ 0, so there is no vector potential. 
39. Hint: Calculate V x (E+ dA/dt). 
Chapter 8 
Section 8.1 
1. —2 3. 6 5. 182 
7. —370 
9. —6a,b3 + 6a3b; + a2b4 — a4b2 
11. 14cosx — 7sinz + lly? +33 
13. 6(e* cos y + (y? + 2)e”*) 
15. 2xydx Ady Adz 
17. 3x3(x3 — x4) dx, A dx. A dx3 A dx4 
19. (xj e™S — x1x2x3 cos x5) dx, A dx A dx3 ^ dx4 A dxs 


Section 8.2 


I: 


. (a) Uxa (Tu, Tu, Ta) = u1 >0 for 


Hint: Show linear independence of Ty,, To,, To, by 
solving the vector equation cıT, + c2T», + c3To, = 0 
for c1, C2, C3. 

X: [0, 2m) x [0, 27) x [0, 27) x [1,2] x [1,2] > R£, 
X(81, 62, 03, l2, 13) = (x1, Y1, X2, Y2, X3, y3), Where xı = 
3cos6|, yı =3sinO,, x2 = 3cosı +l cos, y2 = 
3sinĝı + h sinO,, x3 = 3 cos 1 + l cos@ + 13 cos 63, 
y3 = 3 sin 01 + l sin + b; sin 63 

—2r 

4r? 

0< ui < v5 


which is where the parametrization is smooth. 
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Answers to Selected Exercises 


(b) Parametrize 0M in two pieces as 
Y,:[0, /5] x [0, 277) > R, 
Yi (51, 52) = (81 cos 52, 51 sin s2, $? — 6) and 
Y>:[0, /5] x [0, 277) > R, 


Y2(s1, 52) = (S1 COS $2, S1 SIN s2, 4 — s?). 
(2s, cos s2, 2s1 sin s2, — 1) 


(c) Vi = ; 
Jas +1 
(2s, cos s2, 2s1 sin s2, 1) 
2 — 
457 +1 
13. 327 
3 
1S: 5 


Section 8.3 


1. dw =e (yz dx + xz dy + xy dz) 
3. dw = —y dx Ady 
5. dw = (x + 2yz)dx ^ dy ^ dz 


7. dæ = 2x, — x2 + x3 — --- + (—1)"*! x) dx, A dx A 
see A dXn 


9. F(x,z)= xz + Cz + Di, 


11. 


G(x, y) = xy + Cy + Dy, 
where C, Dı, D2 are arbitrary constants. 


Jom = Jy do =0 


True/False Exercises for Chapter 8 


True 3. True 
True 7. True 
True 


. False. (X(1, 1, —1) = X(1, 1, 1), so X is not one-one 


on D.) 


. False. (The agreement is only up to sign.) 

. False. (This is only true if n is even.) 

. True 

. True. (dœ would be an (n + 1)-form, and there are no 


nonzero ones on R”.) 


Miscellaneous Exercises for Chapter 8 


5. 
A 


9. 


0 

(a) Vx (Vf) =0 

(b) V-(VxF)=0 

Hint: Consider fy d(w A n). 


Index 


N, XV 
U, xv 
V (del operator), 227—228 
C, XV 
C, Xv 


A 


acceleration, 190 

normal component of, 215 

tangential component of, 215 
accumulation point, 103 
addition of vectors, see vector(s), 

addition, 2 

adjoint matrix, 61 
Ampere’s law, 516-517, 518 

static case, 516 

time-varying case, 517, 518 
angle between vectors, 21, 49 
angular momentum, 243 
angular speed, 34 
angular velocity vector, 221 
antiderivative, xvii 
Archimedes’ principle, 506 
arclength function, 205, 217 
arclength parameter, 205-207 
area 

zero, 319 
area element 

Cartesian, 364, 370 

general, 370 

polar, 364, 370 
area of a surface, 464—467 
arithmetic mean, 308 
arithmetic-geometric inequality, 308 
astroid, 219 
average value, 373-377 

of a function on a surface, 525 


B 
ball 
closed, 101 
open, 101 
basis, see vector(s), standard basis, 9 
Bernoulli, Johann, 194 
Bézier curve, 239 
biharmonic function, 187 
binormal spherical image, 242 
binormal vector, see vector(s), binormal, 
212 
BMI, 155 
body mass index, 155 
boundary 
of a manifold, 541 
boundary of a set, 102 
Brahe, Tycho, 193 
budget hyperplane, 56 
buoyant force, 506 


C 


Cauchy—Riemann equations, 242 
Cauchy—Schwarz inequality, 50, 293 
Cavalieri’s principle, 310 
center of mass 
continuous 
in R, 380 
in R?, 380 
in R?, 382 
continuous, 379-383 
discrete, 377—379 
ofa wire, 451 
centroid, 383 
chain rule, 142-153 
in one variable, 143—144 
in several variables, 144—153 
change of variables 
in double integrals, 357, 370 
in triple integrals, 365, 371 
charge density, 514 
charge distribution, 514 
C™, 138 
circular helix, 190 
circulation, 413, 476 
circulation density, 498 
C*, 138 
Clairaut, Alexis, 137 
class C™, 138 
class C*, 138 
closed box, 337 
closed rectangle, 310, 314 
closed set, 102 
clothoid, 241 
Cobb-Douglas production function, 302 
codomain of a function, 82 
cofactor, 57 
column vector, 52 
commodity bundle, 56 
compact set, 271 
component functions, 85 
conduction current density, 517 
conductivity, 522 
connected, 441 
conservative vector field, 297, 441 
constraint, 279 
constraint equation, 278 
continuity 
piecewise, 318 
continuity equation, 516, 520 
for current densities, 520 
in fluid dynamics, 520 
contour curves, 87 
definition, 88 
coordinate axes 
in R?, xv 
in R?, xv 
coordinate curves 


of a manifold, 538 
coordinate transformations, 349-355 
coordinates 
Cartesian, 62 
conversions 
between Cartesian and cylindrical, 
65 
between Cartesian and polar, 64 
between Cartesian and spherical, 
68 
between cylindrical and spherical, 
68 
cylindrical, 65 
hyperspherical, 72—73 
on R?, xv 
on R?, xv 
on the real line, xv 
polar, 62 
rectangular, see coordinates, 
Cartesian, 62 
spherical, 66 
Copernicus, Nicholas, 193 
critical point, 182, 265 
constrained, 279 
degenerate, 268, 288 
cross product, 27—36 
and determinant, 31 
applications, 32-35 
definition, 28 
in R”, 61 
properties, 29 
curl, 227, 229-231, 497 
in cylindrical coordinates, 233, 508 
in spherical coordinates, 235, 509 
current density, 514 
conduction, 517 
displacement, 517 
curvature, 209-210, 216, 217 
total, 452 
radius of, 240 
curve, 419 
convex, 452 
Bézier, 239 
closed, 419 
simple, 419 
cusp, 80 
cycloid 
curtate, 17 
prolate, 18 


D 


Darboux formulas, 221 

Darboux rotation vector, 221 

definite integral, see integral, definite, 
xvi 

degree of a polynomial, 141 
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degree of a term, 141 
del operator (V), 227-228 
derivative, xv, 124, 127-128 
directional, 160, 161 
maximization of, 163 
minimization of, 163 
of a vector field, 236 
exterior, see exterior derivative, 553 
linearity of, 134 
normal, see normal derivative, 453, 
507, 520 
partial derivative 
mixed, 136 
of higher order, 136-138 


partial derivative of f with respect to 


x;, 117 
determinant, 30, 56 
properties, 60 
differentiability 
and continuity, 122, 127 
definition, 121 
in general, 126 
of a function of three or more 
variables, 123—127 
of a function of two variables, 
120-123 
differential geometry, 202 
differential operators, 152 
differentiating under the integral sign, 
405 
direction cosines, 26 
directional derivative, see derivative, 
directional, 160 
displacement current density, 517 
displacement vector, 5 
distance 
between parallel planes, 46 
between point and line, 45—46, 77 
between point and plane, 77 
between skew lines, 46-47 
divergence, 228-229, 496 
in cylindrical coordinates, 233, 
507 
in spherical coordinates, 235, 508 
divergence theorem, see Gauss’s 
theorem, 493 
in the plane, 432 
domain of a function, 82 
dot product, 18-25, 36 
definition, 19 
properties, 19 
double integral, 316 
in polar coordinates, 360-362 


E 


eigenvalue, 309 
eigenvector, 309 
electric field 
of a continuous charge distribution, 
514 
of a single point charge, 512 


electromotive force, 517 
elementary region 
in space, 340-343 
definition, 340 
in the plane, 321-327 
definition, 321 
ellipsoid, 94 
parametrized, 468, 525 
elliptic cone, 94 
elliptic paraboloid, 94 
endowment vector, 56 
energy 
kinetic, 297 
potential, 297 
epicycles, 193 
epicycloid, 80, 437 
epitrochoid, 80 
equation of continuity, see continuity 
equation, 520 
equation of first variation, 227 
equilibrium point, 298 
stable, 298 
unstable, 298 
equipotential line, 224 
equipotential set, 224 
equipotential surface, 224 
Euler’s formula, 188 
evolute, 240 
extension (f**'), 322, 343 
exterior derivative, 553 
of a k-form, 553 
of a 0-form, 553 
exterior product, 534-535 
properties of, 535 
extrema 
absolute, 264 
constrained local, 288 
global, 264, 270-274 
local, 263 
second derivative test, 268 
extreme value theorem, 271 


F 
Faraday’s law, 517-518 
Fenchel’s theorem, 452 
field 
scalar, see scalar field, 222 
vector, see vector field, 221 
first variation 
equation of, see equation of first 
variation, 227 
flow line, 225 
flow of a vector field, 227 
flux, 432, 476 
flux density, 498 
force 
buoyant, 506 
Frenet—Serret formulas, 217—219 
Fresnel integrals, 241 
frustum, 526 
Fubini’s theorem, 314, 319, 339 
functions, 82—94 


average value, 374 
along a curve, 451 
codomain, 82 
component, see component functions, 
85 
continuous, 109 
algebraic properties, 111 
domain, 82 
extension of (f'), 322, 343 
homogeneous, 187, 309 
injective, 83 
linear, 55 
mean value of, 374 
of more than one variable, 84—94 
graphing, 87—94 
quadric surfaces, see quadric 
surfaces, 93 
one-one, see functions, injective, 83 
onto, see functions, surjective, 83 
partial, 116 
polynomial, 106 
potential, 223, 441 
range, 83 
scalar-valued, 84 
smooth, 138 
surjective, 83 
fundamental theorem of calculus, 
xvii 


G 
gauge freedom, 528 
Gauss’s law, 512—514, 518 
differential form, 514 
integral form, 514 
Gauss’s theorem, 433, 493—496 
implied by generalized Stokes’s 
theorem, 558—559 
proof of, 503-505 
generalized Stokes’s theorem, see 
Stokes’s theorem, generalized, 
553 
geometric mean, 308 
global extrema, 264, 270-274 
on compact regions, 270-274 
gradient, 124, 158, 227-228 
in cylindrical coordinates, 233 
in spherical coordinates, 235 
gradient field, 223 
gravitational potential, 388 
Green’s first formula, 510 
Green’s first identity, 453 
Green’s formulas, 510 
Green’s second formula, 510 
Green’s second identity, 453 
Green’s theorem, 429 
implied by generalized Stokes’s 
theorem, 557 
proof of, 433-436 
vector reformulation of, 431 
Green’s third formula, 510 
gyration, radius of, 385, 451, 526 


H 
harmonic function, 142, 243, 453, 511, 
520 
head-to-tail addition, 4 
heat equation, 142, 520, 521 
uniqueness of solutions to, 521 
heat flux density, 489, 520 
helicoid, 142, 468 
helix, 85, 190 
Hessian, 255 
Hessian criterion 
for constrained extrema, 286—290 
for extrema, 265—270 
Hilbert matrix, 79 
homogeneous function, 187, 309 
hyperbolic paraboloid, 90, 94 
hyperboloid 
parametrized, 524 
hyperboloid of one sheet, 94 
hyperboloid of two sheets, 94—95 
hyperplane 
budget, 56 
hypersphere, 73, 167, 293, 308 
hyperspherical coordinates, 72—73 
hypersurface, 73, 124 
hypocycloid, 80 
hypotrochoid, 80 


l 
ideal gas law, 96, 158 
identity matrix, 59, 177 
implicit function theorem, 168—170 
general case, 170 
improper integral, 405—406 
incremental change, 249 
injectivity, 83 
inner product, see dot product, 19 
in R”, 49 
integrability, 316, 339 
integral 
definite, xvi 
double, 316 
improper, 405 
in polar coordinates, 360-362 
improper, 405—406 
iterated, 311 
line 
scalar, see line integral, scalar, 409 
vector, see line integral, vector, 411 
of a differential form, 536—542 
of a k-form over a k-manifold, 541 
of a 2-form, 537 
properties of 
linearity, 320 
monotonicity, 320 
triple, 338 
improper, 405, 522 
in cylindrical coordinates, 366-368 
in spherical coordinates, 368—370 
integration by parts for differential 
forms, 562 


intersection of sets, xv 
inverse function theorem, 172 
inverse matrix, 61 

involute, 15, 240 

isobars, 224 

isolated point, 103 

isoquants, 302 

isotherms, 224 

iterated integral, 311 


J 

Jacobi identity, 39, 78 
Jacobian, 172, 356, 365 
judo, 6-7 


K 
k-form 

basic, 533 

general, 533 
Kepler, Johannes, 193 
kinetic energy, 297 
Klein bottle, 489 


L 


Lagrange multipliers, 280-290 
lamina, 380 
Laplace’s equation, 141, 453 
Laplacian operator, 157, 187, 236 
inversion formula for, 511 
law of conservation of energy, 454 
least squares approximation, 
293-297 
Leibniz’s rule, 405 
length, 204 
level curves, 87 
definition, 88 
limit(s), 97-111 
algebraic properties, 106 
geometric interpretation, 103—104 
intuitive definition, 98 
rigorous definition, 99—100 
uniqueness, 106 
line 
parametric equations of, 
see parametric equations, of a 
line, 13 
line integral 
differential form, 415 
numerical approximation of, 421—426 
path-independent, 439, 440 
scalar, 409 
vector, 411 
line segment, 76 
linear combination, 543 
linear independence, 284, 539 
linear mapping, see mapping, linear, 55 
linear regression, 293 
linear span, 543 
lines 
skew, 46 
Lorentz force, 428 
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M 


magnetic field 
of a moving charge distribution, 515 
of a moving point charge, 514 
magnetic monopoles, 516, 518 
manifold, 202, 538 
underlying, 538 
boundary of a, 541 
general, 551 
smooth, 539 
mapping, linear, 55 
matrices, 30-32, 51-58 
adjoint, 61 
cofactor, 57 
cofactor expansion, 57 
determinant, 56 
elementary row operations, 60 
Hilbert, 79 
identity, 59, 177 
inverse, 61 
invertible, 61, 177 
matrix product, 53 
properties, 54 
minor, 57 
nilpotent, 79 
nonsingular, 61 
scalar multiplication, 53 
properties, 53 
symmetric, 267, 309 
transpose, 54 
triangular, 60 
matrix of partial derivatives, 125 
maximum, see extrema, 263 
Maxwell’s equations, 512-518 
mean value theorem, 329 
for double integrals, 399, 526 
for triple integrals, 497 
general version, 260 
mean value theorem for integrals, 260 
method of least squares, 293 
minimal surface, 142 
minimum, see extrema, 263 
minor, 57 
Möbius strip, 480 
moment, 377 
first, 377, 525 
of inertia, 384-386, 451, 526 
second, 384-386 
total, 377 
monopoles 
magnetic, see magnetic monopoles, 
516 
moving frame, 211, 217 
multiple regression, 297 


N 

n-dimensional volume, 560 
negative definite, 267, 309 
neighborhood, 102, 122 
nephroid, 81 

net force, 8 
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Newton’s method, 176-181 
Newton, Isaac, 194 
nilpotent matrix, 79 
nonorientable, 479 
normal derivative, 453, 507, 520 
normal line 
to a plane curve, 175 
to a surface in space, 175 
normal plane, see plane, normal, 221 
normal spherical image, 242 
normalization, 23 
numerical integration 
for functions of one variable, 388-391 
for functions of two variables, 
391-400 
Monte Carlo method, 399 


O 


octant, xv 
1-form 
basic, 530 
general, 530 
one-one, see injectivity, 83 
one-sided, see nonorientable, 479 
onto, see surjectivity, 83 
open set, 102 
opposite path, 417 
orientable, 479, 543 
orientation, 420, 479, 543 
compatible, 543 
consistent, 490, 546 
induced, 490, 546 
oriented, 420, 543 
origin 
in RÊ, xv 
in R3, xv 
orthogonal, 21 
osculating plane, see plane, osculating, 
211, 220 
Ostrogradsky’s theorem, see Green’s 
theorem, 436 
Ostrogradsky, Mikhail, 436 


P 


parallel axis theorem, 407 
parallelogram, 7 
parallelogram law, 4 
parametric equations 
of a line 
symmetric form, 12 
parametric equations, 10 
of a cycloid, 14-15 
of an involute, 15 
of a line, 9-13 
in terms of a point and a direction, 
10 
in terms of two points, 12 
parametrization, 419 
parametrized surfaces, 455—467 
definition, 455 
smooth, 460 


area of, 464-467 
piecewise smooth, 463 
partial derivative, 117 
mixed, 136 
of higher order, 136-138 
partial functions, 116 
partition, xvi, 314, 338 
path, 189 
closed, 419 
endpoints, 189 
flow line, 225 
intrinsic quantities, 217 
nonintrinsic quantities, 216 
nonrectifiable, 205 
opposite, 417 
rectifiable, 205 
simple, 419 
tangent line, 191 
velocity vector, 190 
path independence, 439, 440 
permeability of free space, 514 
permittivity of free space, 512 
perpendicular, see orthogonal, 21 
perpendicular bisector, 76 
plane 
coordinate equation for, 40—43 
normal, 221 
osculating, 211, 220 
parametric equations for, 43—45 
rectifying, 221 
planetary motion, Kepler’s laws of, 
193-200 
polynomial, 106, 107 
position vector, 3 
positive definite, 267, 309 
potential, 441 
vector, see vector potential, 528 
potential energy, 297 
potential function, see functions, 
potential, 223 
potential theory, 142 
Poynting vector field, 522 
principal minors, sequence of, 268 
principal normal vector, see vector(s), 
principal normal, 211 
probability density, 406 
joint, 406 
product rule 
for scalar-valued functions, 135 
nonexistence of a general form, 135 
of a scalar-valued and a vector-valued 
function, 136 
projection of a vector, 21-24 
Pythagorean theorem, 59 


Q 

quadrant, xv 

quadratic form, 267, 309 
negative definite, 267, 309 
positive definite, 267, 309 

quadric surfaces, 93—95 


ellipsoid, 94 

elliptic cone, 94 

elliptic paraboloid, 94 

hyperbolic paraboloid, 94 

hyperboloid of one sheet, 94 

hyperboloid of two sheets, 94—95 
quotient rule 

for scalar-valued functions, 135 

nonexistence of a general form, 135 


R 
radius of curvature, 240 
radius of gyration, 385, 451, 526 
random variables 
independent, 407 
range of a function, 83 
rectifying plane, see plane, rectifying, 
221 
reparametrization, 416 
of a k-manifold, 549 
of a manifold, 543 
of a surface, 477 
orientation-preserving, 416, 418, 478, 
549 
orientation-reversing, 416, 418, 478, 
549 
reparametrization of a path, 205—207 
resultant force, 8 
Riemann sum, xvi, 315, 338 
right-hand rule, 28, 36 
right-handed system, xv 
row vector, 52 


5 
s-coordinate curve, 458 
saddle point, 265 
scalar field, 222 
scalar line integral, see line integral, 
scalar, 409 
scalar multiplication, 2, 4, 36 
in R? and R?, 2 
in R”, 49 
properties, 3 
scalar potentials, 445—447 
scalar product, see dot product, 19 
scalars, 1 
Scherk’s surface, 142 
second derivative test 
for constrained extrema, 286—290 
for extrema, 265—270 
section of a surface, 88 
set 
boundary of a, 102 
closed, 102 
compact, 271 
open, 102 
simply-connected region, 442 
Simpson’s rule 
for functions of one variable, 390 
for functions of two variables, 394 
smooth function, 138 


Snell’s law of refraction, 308 
solid angle, 527 
span, 543 
linear, 543 
speed, 190 
sphere, 92 
spiral of Cornu, 241 
Spirograph, 79 
standard basis vectors, see vector(s), 
standard basis, 9 
standard normal vector, 460 
steradians, 527 
stereographic projection, 524 
Stokes’s theorem, 431, 490-493 
generalized, 553 
and the fundamental theorem of 
calculus, 560 
implied by generalized Stokes’s 
theorem, 557-558 
proof of, 500-503 
strake, 241 
subset, xv 
surface 
minimal, 142 
of revolution, see surface of 
revolution, 525 
oriented, 481 
surface area, 464-467 
surface area element, 485 
surface integrals, 469-488 
scalar, 470, 485 
vector, 474, 486 
surface of revolution, 525 
surjectivity, 83 
symmetric matrix, 267, 309 


T 


t-coordinate curve, 458 
tangent hyperplane, 124, 166 
tangent line, to a path, 191 
tangent plane, 118, 164-168 
equation, 166 
parametric equations for, 469 
to a smooth parametrized surface, 462 
tangent spherical image, 242 
Taylor polynomial, 244, 246 
first-order, 248 
higher-order, 256 
second-order, 254 
Taylor’s theorem 
in one variable, 244—247 
in several variables 
first-order formula, 247—248 
formulas for polynomials of order 
greater than two, 256 
second-order formula, 252—254 
Lagrange’s form of the remainder, 257 
telegrapher’s equation, 522 
thermal conductivity, 520 


tofu, 263, 340 
topology, 101 
torque, 34, 243, 385 
torsion, 212-214, 217-219 
torus, 202, 459 
total differential, 249 
tractrix, 240 
transpose, 54 
trapezoidal rule 
for functions of one variable, 388 
for functions of two variables, 392 
triangle inequality, 51 
triple integral, 338 
in cylindrical coordinates, 366-368 
in spherical coordinates, 368-370 
2-form 
basic, 531 
general, 532 
two-sided, see orientable, 479 


U 


underlying manifold, 538 
underlying surface, 455 
union of sets, xv 

unit vector, 22, 23 

utility, 301 

utility function, 186 


V 
vector field, 189, 221 
conservative, 297, 441 
curl of, 229-231 
divergence of, 228—229 
gradient, 223, 297, 441 
incompressible, 229 
irrotational, 231 
radially symmetric, 453 
solenoidal, 229 
vector line integral, see line integral, 
vector, 411 
vector potential, 516, 528 
vector product, see cross product, 28 
vector projection, 21—24 
vector surface integral element, 486 
vector(s), 1 
addition, 4 
in R”, 49 
properties, 2 
algebraic notion, 1-3 
angle between, 21 
in R”, 49 
binormal, 212, 217 
cross product 
properties, 29 
cross product, 27-36 
and determinant, 31 
applications, 32-35 
definition, 28 
definition 
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in R? and R$, 1 
in R”, 49 
difference, 4 
displacement, see displacement 
vector, 5 
distance, 49 
dot product, 18-25, 36 
definition, 19 
properties, 19 
equality 
in R? and R?, 1 
in R”, 49 
geometric notion, 3—7 
gradient, 158 
inner product 
in R”, 49 
length, 19, 49 
magnitude, see vector(s), length, 19 
norm, see vector(s), length, 19 
notation, 1 
position vector, see position vector, 3 
principal normal, 211, 217 
projection of, 21—24 
scalar multiplication, 2, 4, 36 
in R? and R°, 2 
in R”, 49 
properties, 3 
standard basis, 9 
for cylindrical coordinates, 71—72 
for spherical coordinates, 71—72 
in R”, 50 
unit, 22, 23 
unit normal, outward, 432 
unit tangent, 207, 412 
zero, 2 
vectors 
standard normal, 460 
velocity, 190 
volume, 310 
zero, 339 
volume element 
Cartesian, 371 
general, 365, 371 
in cylindrical coordinates, 367, 371 
in spherical coordinates, 369, 371 


W 


wave equation, 187 

wedge product, see exterior product, 534 
Whitney umbrella, 468 

windchill, 84, 185 

work, 26, 411 


Z 


zero area, 319 
zero vector, 2 
zero volume, 339 
0-form, 530 


